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TITLE: Protein Identification Methods and Systems 
FIELD OF THF. INVENTION 

The invention relates to methods and systems for identifying proteins. 
BACKGROTTMn r»F THE mw.NTinu 

5 Database searching for peptide identification using mass spectrometry data as queries is now 

commonplace. However, an ongoing problem in mass spectrometry is the time it takes to search 
^annotated genomic DNA sequences with MS/MS peptide information, especially with large amounts of 
data as found m LC/MS/MS run, Choudhary et al. (Proteomics 2001:651-667) reported L useT thl 
genome as a database but the technique suffered from long search times. They reported search times of 10 
10 hours on a smgle 600 MHz Intel CPU for ,69 MS/MS spectra (about 3.5 minutes per spectrum). This is 

onger than the acquisition time. Parallelization of any search software on a Beowulf cluster requires 
doublmg the amount of computers each time to cut the search time in half. Thus, there is a need for fast 
and efficient methods and systems for identifying proteins from mass spectrometry peptide 

15 artt * r te * 0n0f " yrefW ^^^ 
15 art to the instant invention 

SUMMARY OF T HE INVttNTTniNI 

The present inventors have deve.oped a new approach to protein identification. The approach 
enables de novo protein sequencing of a genome in a very fast and cost effective manner. In particular the 

20 m !r e Se T mg St6PS ^ 6,131 ^ ° rderinS PhaSe — spectrometry sequencing 

20 methods can be avoided allowing the sequencing speeds and overall m ass spectrometry through put to b 

to be analyzed to obtam the full protein sequence. Thus, the methods and systems can use small quantities 
of protems smce only a few peptides need to be analyzed, m addition, the methods and systems by 
generatmg a list of peptide masses for the full protein sequence make it easier to distinguish true proteins 
15 .n the sample and artifacts generated by noise from contaminant protcin , ' 

In an aspect the approach utilizes mass spectre-metric techniques and a hardware-based searching 
algonthm. Tins system is capable of locating peptide queries (interpreted mass spectrometry data) in a 
genome and scoring each matching location based on the uninterpreted data from the mass spectrometer 

ThUS ' thC ta ~*" P™*** a method for identifying a protein through amino acid sequences of 
>0 oneormorequerypeptidesgeueratedfromtheproteincomprising: 

(a) translating amino acid sequences of one or more query peptides to all possible codons 
from which the peptides can be synthesized to prepare strings of codons- 

(b) searching known nucleic acid sequences, in particular a set of known nucleic acid 
sequences including a genome, to locate one or more known nucleic acids that comprise 
regions that match the strings of codons; and 

(c) optionally ranking two or more matching nucleic acids to identify nucleic acids that are 
true coding regions for the protein to thereby identify the protein 

In a particular aspect the invention provides a method for identifying a protein comprising- 
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(a) 
(b) 



providing amino acid sequences of query peptides generated by mass spectrometry 
of peptides cleaved from the protein; 

plating amino acid sequences of one or more query peptides to all possible 
codons from which the peptides can be synthesized to prepare strings of codons- 

(c) searching known nucleic acid sequences, in particular a set of known nucleic' acid 
sequences including a genome, to locate one or more known nucleic acids that 
comprise regions that match the strings of codons; and 

(d) optionally ranking two or more matching nucleic acids to identify nucleic acids that 
are true coding regions for the protein to thereby identify the protein 

In an embodiment of the invention, the strings of codons are provided as simultaneous parallel 

In a still further embodiment, the method allows unknown amino acids in a sequence to be coded 

oeotid I ^ ° f *» *■ -king is based on a comparison of the masses of 

Jnt f T T SCqUenCeS ^ Pf0Ximity t0 ^ ^ *™ —■ — - -eh the 

stnngs of codons, w.th masses of peptides of the protein other than the query peptides 

In a particular embodiment of a method of the invention the ranking step comprises the following: 
(a) calculatmg the masses of peptides translated from sequences in proximity to the regions 

in the known nucleic acids that match the strings of codons- 
00 co mparmg the masses calculated in (a) with masses of peptides of the protein other than 

the query peptides, or fragments thereof, to identify peptides with matching masses- 
(O assigning scores to each matching mass and accumulating the scores for all maiching 
masses in proximity to the regions in the known nucleic acids that match the strings of 
codons; and 5 

(d) optionally ranking two or more known nucleic acids that match the strings of codons 
based on the accumulated scores to identify potential nucleic acids encoding me protein 
to thereby identify the protein. 

m an embodiment, the masses calculated in (a) are compared with masses identified by mass 
spectrometry for peptides of the protein other than the query peptide, m particular, the masL T 
comparedwithmassesidentif,edina P recursorionscan(PIS). 

potential 1116 ^ ^ ^ ™<™*«™ -corning the 

potential nucle.c ac.ds encoding the protein. Such additional step may involve finding canonical splice 

anant masses that can be mrther compared with a P!S mass list to identify splice ovLp peptide!^ 
help solve the gene structure of detected proteins. 

In aspects of methods of the invention, the query peptides are at least 4 or 5 amino acids in length 
In another aspect of the invention, at least two query peptides are translated 
A method and/or system of the invention may generally comprise the following features- 
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(a) 



A method of locating potential coding genes within a genome. A database search engine 
is provided that is capable of locating query DNA strands within a genome. 



(b) 



A method of translating genes to find the masses of tryptic peptides they generate. Once 
potential genes have been located, they are translated and digested in silico (by 
computation) to obtain the masses of the tryptic peptides. 



(c) 



A method of comparing calculated tryptic peptide masses with masses detected by a first 
mass spectrometer. The tryptic peptides generated from each gene are compared with the 
precursor ion scan (PIS) list of masses. A scoring algorithm ranks every matching mass 
and thus a score for each gene match is generated to help the user to quickly identify the 
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true coding gene. 

(d) Fast overall processing time. Proteins should be identified in the time that a second mass 

spectrometer generates a sequence which on average is between 0.5 to 1 second. 
The methods of the invention are generally executed in a computer apparatus/system. 
In an embodiment, a computer implemented system is provided for identifying a protein through 
amino acid sequences of one or more query peptides generated from the protein comprising: 

(a) a search engine for locating regions of known nucleic acid sequences that match strings 
of codons translated from one or more query peptides; 

(b) a mass calculator for calculating masses of peptides translated from sequences in 
proximity to regions in known nucleic acid sequences that match the strings of codons; 
and 

(c) optionally a scoring unit for (i) comparing masses calculated in (b) with masses of 
peptides of the protein other than the query peptides to identify peptides with matching 
masses; (ii) assigning scores to peptides with matching masses; and (iii) accumulating 
scores for all matching masses in proximity to or around the regions located in (a) to 
evaluate the likelihood that a region is a true coding region for the protein. 

The invention further relates to a programmable hardware employing a method of the invention. 
In particular, a method of the invention may be implemented using a hardware acceleration system. 

In an aspect the invention provides a hardware acceleration system for identification of a protein 
comprising a generic circuit board capable of being plugged into a computing device wherein the circuit 
board comprises logic chips and memory wherein the memory comprises nucleic acid sequence 
information, and the chips provide means to search through the nucleic acid sequence information for 
regions matching strings of codons translated from one or more query peptides provided to the computing 
device as input. The query peptide may be provided to the computing device as input from a mass 
spectrometer. 

In an embodiment, a method of the invention is implemented using field programmable gate array 
(FPGA) technology. In another embodiment, a method of the invention is implemented using application- 
specific integrated circuit (ASIC) technology. 

Information on the masses of the query peptides and peptides translated from the region around a 
hit or match nucleic acid sequence generated using a method of the invention and nucleic sequences and 



WO 2004/109579 



PCT/CA2004/0008S3 



-4- 



10 



their scores identified using a method of the invention may be incorporated in or stored on a computer- 
readable medium or database. Thus, the invention provides a database storing data relating to strings of 
codons, matching nucleic acids, masses, scores, or methods of the invention. The invention also provides a 
computer system for storing this information.- 

The invention also provides computerized representations of information generated using a 
method of the invention, including any electronic, magnetic, or electromagnetic storage forms of the data 
needed to define it such that the data will be computer readable for purposes of display and/or 
manipulation. 

The invention also contemplates a computer program product comprising a computer-usable 
medium having computer-readable program code embodied thereon for effecting the steps of a method of 
the invention, in particular identifying matching nucleic acids and identifying the protein within a 
computing system. 

In an aspect the invention provides a computer comprising a machine-readable data storage 
medium comprising a data storage material encoded with machine readable data wherein said data 
15 comprises information generated using a method of the invention. 

The invention also provides a system for managing and identifying proteins and methods for 
presenting information pertaining to nucleic acid sequences that potentially encode a protein. 

Methods, systems, databases, and computer products of the present invention may be used to 
determine information for a protein. They may be used to identify protein sequences that, for example, 
may be associated with disease or that can be used in drug design. In an embodiment, the methods and 
systems of the invention may be used to identify proteins in samples from patients. 

These and other aspects, features, and advantages of the present invention should be apparent to 
those skilled in the art from the following drawings, detailed description, and example. 
DESCRIPTI ON OF THE DRAWINGS AND TABIDS 

The invention will now be described in relation to the drawings in which: 
Figure 1 shows a tryptic digestion of a large peptide. 
Figure 2 shows an outline of an algorithm of the invention. 
Figure 3 shows the architecture of a system of the invention. 
Figure 4 shows search engine amino acid and peptide units. 
30 Figure 5 illustrates locating a query in memory. 

Figure 6 shows parallel comparisons of identical queries to memory. 

Figure 7 shows a schematic diagram of calculator and detection units of a method and apparatus 
of the invention. 

Figure 8 shows a schematic diagram of calculator architecture of the invention. 
35 Figure 9 illustrates complementary strand calculations. 

Figure 10 is a schematic diagram showing a comparison of calculated masses with PIS. 
Figure 11 is an example of a frequency table. 

Figure 12 is a schematic diagram showing architecture of a device of the invention. 
Figure 13 is a schematic diagram showing genome decompression. 
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Figure 14 is a schematic diagram showing query reverse translation. 
Figure 15 is a schematic diagram showing full search engine architecture. 
Figure 16 is a schematic diagram showing a search of the genome. 
Figure 17 is a schematic diagram showing a peptide unit structure. 
5 Figure 18 is a schematic diagram showing a peptide unit operation. 

Figure 19 is a schematic diagram showing pipeline AND operation. 
Figure 20 is a schematic diagram showing a codon unit operation. 
Figure 21 is a schematic diagram showing implementation details of a codon unit. 
Figure 22 is a schematic diagram showing selection of a gene. Hit located in genome. Genes on 
1 0 either side of hit window are translated. 

Figure 23 is a schematic diagram showing translation of a gene to protein using a mass calculation 
process. Gene window translated from DNA to amino acid sequence. 

Figure 24 is a schematic diagram showing digestion of protein and calculation of tryptic peptide 
masses. Tryptic peptides detected in amino acid sequences. Peptide masses calculated. 
15 Figure 25 is a schematic diagram showing calculator architecture. 

Figure 26 is a schematic diagram showing a single stage of calculator. 
Figure 27 is a schematic diagram showing calculator subunits. 
Figure 28 is a schematic diagram showing complementary strand calculation. 
Figure 29 is a schematic diagram showing parallel six-frame calculations. 
20 Figure 30 is a schematic diagram showing scoring unit architecture. 

Figure 31 is a schematic diagram showing data associative mass storage. 
Figure 32 is a schematic diagram showing building the frequency histogram. 
Figure 33 is a schematic diagram showing updating of the histogram. 
Figure 34 is a schematic diagram showing mass matching. 
25 Figure 35 is a schematic diagram showing calculation of the product term. 

Figure 36 is a scaled representation of the distance between two queries. 
Figure 37 is a scaled representation of the distance between two queries. 
Figure 38 is a schematic diagram showing a device partitioned across TM3A 
Description of Embodiments of the Invention 
30 Given a mass spectrometry (MS) spectra of a peptide cleaved from a protein, it is possible to 

generate a corresponding sequence for the peptide (4). Since the peptide was cleaved from a protein, it can 
be assumed that there exists a gene within a genome that codes this protein. If the gene's coding region 
could be located quickly, it could be translated to its amino acid sequence. This longer sequence, obtained 
from the genome, can be compared to other fragments analyzed by mass spectometry as well as intron- 
35 exon splice variants. 

Matching a mass spectrometry derived short peptide sequence to an unannotated genome 
represents an approach that Applicants have found to be well suited for hardware acceleration. Searching 
through unannotated DNA allows peptides to be identified that are missed by gene prediction algorithms as 
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these are appreciable, even in organisms like Saccharomyces cerevisiae, expected to have a complete 
known set of protein coding regions (12). 

As described herein the invention provides methods and systems for identifying a protein through 
amino acid sequences of one or more query peptides generated or cleaved from the protein. The methods 
5 and systems may be particularly useful for identifying proteins isolated from natural sources, patient 
samples, or from libraries that have been prepared synthetically. 

The query peptides employed in the methods and systems of the invention may be generated or 
cleaved from a protein, in particular an unknown protein to be identified, using conventional techniques. In 
an aspect, peptides are generated using enzymatic digestion. In an embodiment, peptides are generated 
10 using proteolytic enzymes such as trypsin which cleaves at K and R residues (except where followed by 
proline). 

The amino acid sequences of peptides generated from a protein may be determined using 
conventional molecular biology and recombinant DNA techniques and mass spectrometry techniques 
within the skill of the art 

15 In an aspect, the amino acid sequences of the peptides are determined using mass spectrometric 

techniques. In an embodiment, amino acid sequences of peptide fragments are determined using a tandem 
mass spectrometer. Examples of such devices include the MDS Sciex Q-Star, the Thermo Finnegan LCQ 
DECA XP, the MDS Sciex Q-TRAP, the Applied Biosystems TOF-TOF, the Waters/Micromass Q-TOF, 
the Bruker Daltonics APEX-Q, and other similar instruments capable of performing MS/MS. By way of 
20 illustation, a tandem mass spectrometer in a first stage performs a precursor ion scan (PIS) on tryptic 
peptides in a protein sample to provide an overview of tryptic fragment masses in the sample. The spectra 
obtained at this stage may be used to generate amino acid sequences for the peptides. In a second stage the 
mass spectrometer selectively filters peptides within a certain range into a chamber where the peptides are 
fragmented through collision with trace gases. In a third stage, the masses of collision-induced fragments 
25 are measured. The spectra obtained can be used to generate amino acid sequences for peptides. 

The query peptides inputted into the methods and systems of the invention may be obtained from 
the spectra produced by the first or third stage of mass spectrometry. In an embodiment, the query peptides 
are obtained from the spectra produced by the third stage of mass spectrometry. 

An amino acid sequence of a peptide is translated to all possible codons from which the peptide 
30 could have been synthesized to prepare strings of codons. There may be multiple codons for each amino 
acid in the peptide. In an embodiment, reverse complements of every query condon string are generated 
and searched against the known sequences. In another embodiment of the invention a computer is utilized 
to translate an amino acid sequence to all possible codons that it could originate from. The information 
may be converted to a form that allows for compression of the strings of codons. In an embodiment, the 
35 information is converted to a 3-bit encoded form that utilizes wildcards. 

Known nucleic acids or sequences, particularly a set or database of known nucleic acids or 
sequences are searched to find regions of known nucleic acids that match strings of codons. Known nucleic 
acids or sequences include nucleic acid sequences from an organism, in particular an organism whose 
entire DNA is sequenced. The whole genomes of many organisms are reported by the NCBI at 
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ww.ncbi.nl^ and in me scientific Hterature ^ ^ embodiment 

the known nucleic acid sequences comprise the human genome or multitude of human genomes In another 
embodiment, the known nucleic acid sequences comprise a set or database or unannotated genomic DNA 
sequences. 

In an aspect of the invention, the search of known nucleic acid sequences may be accomplished 
by aligning multiple copies of the query string of codons with successive positions within known nucleic 
acui sequences. In another aspect, the search may be accomplished by comparing strings of codons to 
known amino acid sequences, reading a new base into the known nucleic acid sequences, and shifting the 
nucleic acid sequences over by one position. 

If multiple hits or matches are located in known nucleic acid sequences each is ranked according 
to rts likelihood of being the true coding region. Ranking may be achieved by selecting nucleic acid 
sequences in proximity to or around (in particular, on either side of) a known matched region, translating 
the sequences into peptides and corresponding peptide (e.g. tryptic) fragments, and determining the mass 
of each of the fragments. In an embodiment, gene-sized windows of nucleic acid sequence are selected on 
erther side of a matched region (e.g. 10 Kbases). Masses of peptides are determined sequentially until a 
breakpoint is reached. Breakpoints may be defined as a codon that indicates a proteolytic enzyme cut site 
(e.g. K or R if not followed by P for trypsin) or a STOP codoa 

The calculated masses are compared with masses of peptides of the protein to be identified other 
than the query peptides. The calculated masses may be compared to the masses seen in a precursor ion 
scan (PIS) of the peptides, other than the query peptides, generated or cleaved from the protein to be 
identified. A score is assigned to each match based on a comparison of the masses of adjacent peptides in 
the matched known sequence with the adjacent sequences in the unknown protein. The match scoring 
system can incorporate both the frequency of occurrence of individual peptides and the number of matches 
in the final score. The match scoring system can incorporate both the frequency of occurrence of individual 
peptides and the number of matches in the final score. Matching masses can be determined within a 
predetermined threshold ( e.g. < IDa). The threshold may be used to identify standard amino acid variants 
(e.g. oxuhzed states or translational modifications). A scoring function may be used to rank matching 
peptides. 

In a computer implemented method, a mass calculator is used to translate all frames 
simultaneously and produce the masses of fragments in parallel with the search of known nucleic acid 
sequences. 

The methods of the invention are preferably executed in a computer apparatus/system. Included in 
a part,cular system of the invention is a processor comprising a mass calculator and scoring functions 
coupled to databases of known nucleic acid sequences, and various input/output devices such as a 
keyboard, mouse, display monitor, printer, and the like. A processor may be of the PC or standalone type 
and have processing capabilities of at least an Intel Pentium I processing chip. Other processors such as a 
mmicomputer, parallel processor, or a networked computer may be suitable. 

In an aspect of the invention a computer implemented system is provided for identifying a protein 
through amino acid sequences of one or more query peptides generated from the protein comprising: 
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(a) a search engine for locating regions of known nucleic acid sequences (e.g. in a database) 
that match strings of codons translated from one or more query peptides; 

(b) a mass calculator for calculating masses of peptides translated from sequences in 
proximity to regions on known nucleic acid sequences that match the strings of codons; 

5 and 

(c) optionally a scoring unit for (i) comparing masses calculated in (b) with masses of 
peptides of the protein other than the query peptides to identify peptides with matching 
masses; (ii) assigning scores to peptides with matching masses; and (iii) accumulating 
scores for all matching masses in proximity to or around the regions located in (a) to 

10 evaluate the likelihood that a region is a true coding region for the protein. 

In an embodiment, the computer implemented system comprises more than one mass calculator 
with each calculator operating in parallel to produce multiple output masses. Additional mass calculators 
may compute masses of each frame and its complement. In another embodiment, multiple instances of the 
scoring unit are implemented, one for each output of the mass calculator. 
15 The invention particularly contemplates a hardware accelerator system or programmable 

hardware for executing a method of the invention. In particular, a method of the invention may be 
implemented using a hardware acceleration system. 

In an aspect the invention provides a hardware acceleration system for identification of a protein 
comprising a generic circuit board capable of being plugged into a computing device wherein the circuit 
board comprises logic chips and memory wherein the memory comprises nucleic acid sequence 
information, and the chips provide means to search through the nucleic acid sequence information for 
regions matching strings of codons translated from one or more query peptides provided to the computing 
•device as input. 

In an aspect the invention provides a hardware acceleration system for identification of a protein 
25 comprising a generic circuit board capable of being plugged into a computing device wherein the circuit 
board comprises logic chips and memory wherein the memory comprises nucleic acid sequence 
information, and the chips provide means to search through the nucleic acid sequence information for 
patterns matching a query that has been provided to the computing device as input from a mass 
spectrometer. 

In the systems of the invention the circuit board has access to the host computing device's 
memory with its operation being controlled by the host 

In an aspect, a method of the invention is implemented using field programmable gate array 
(FPGA) technology. In another aspect, a method of the invention is implemented using application-specific 
integrated circuit (ASIC) technology. 

In an embodiment, the invention provides a computer system comprising one or more field 
programmable gate airay (FPGA) logic chips together with memory storage and input and output channels 
which communicate to a computing device, wherein the memory holds nucleic acid sequence information, 
and the FPGA logic chip initiates searches through the nucleic acid sequence information for matching 
strings of codons translated from one or more query peptides provided to the computing device as input. 
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In a particular embodiment, the FPGA Wo nUi^ • 

(a) a search engine for locating regions of known nucleic acid sequences (e . in a dat k * 
that match strings of codons transit «. sequences (e.g. in a database) 
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download designs into the onboard FPGAs and to communicate with the board to provide input and 
receive output. 

Information generated using a method of the invention, including strings of codons derived from 
query peptides and complements thereof; the masses of the query peptides and peptides translated from the 
region around a hit or match nucleic acid sequence, and the identity of the matching known nucleic 
sequences, their scores and ranking, may be incorporated in or stored on a computer-readable medium or 
database. Thus, the invention provides a database storing data relating to strings of codons, matching 
nucleic acids, masses, scores, or methods of the invention. The invention also provides a computer system 
for storing this information. 

In an embodiment, the invention contemplates a database comprising a set of masses 
corresponding to the masses of the query peptides and the peptides translated from a matching region in 
proximity to or around a known nucleic acid generated in a method of the invention. The invention also 
contemplates a database comprising scores assigned to peptides with matching masses, accumulated scores 
for all matching masses, and nucleic acid sequences identified using a method of the invention that 
15 potentially encode a protein to be identified and the scores and rankings for the nucleic acids. 

The invention also provides computerized representations of information generated using a 
method of the invention, including any electronic, magnetic, or electromagnetic storage forms of the data 
needed to define it such that the data will be computer readable for purposes of display and/or 
manipulation. 

In an aspect the invention provides a computer comprising a machine-readable data storage 
medium comprising a data storage material encoded with machine readable data wherein said data 
comprises information generated using a method of the invention. 

The invention also provides a method for presenting information pertaining to nucleic acids that 
potentially encode a protein the method comprising the steps of: (a) providing an interface for entering 
query information generated from mass spectrometry relating to amino acid sequences of peptides 
generated or cleaved from the protein; (b) examining records in a database of known nucleic acid 
sequences to locate regions in the nucleic acid sequences matching strings of codons translated from the 
entered query peptides' amino acid sequence information; (c) displaying the data relating to the matched 
string of codons and regions in the nucleic acids; and (d) optionally displaying the masses of the peptides 
generated from mass spectrometry and the masses of peptides encoding regions in proximity to the regions 
of known nucleic acids that match the string of codons. The method may also comprise displaying scores 
for each matching mass, accumulated scores for all matching masses around or in proximity to the regions, 
and/or the rankings for the nucleic acids based on the accumulated scores. 

The invention also contemplates a computer program product comprising a computer-usable 
medium having computer-readable program code embodied thereon for effecting the steps of a method of 
the invention, in particular, identifying matching nucleic acids and identifying the protein within a 
computing system. 

The invention also provides a system for electronically identifying proteins employing a genome 
of an organism. 
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Methods, systems, databases, and computer products of the present invention may be used to 
determine information for a protein. They may be used to identify protein sequences that, for example 
may be associated with disease or that can be used in drug design. In an embodiment, the methods and 
systems of the invention may be used to identify proteins in samples from patients. 

Having now described the invention, the same will be more readily understood through reference 
to the following examples that are provided by way of illustration, and are not intended to be limiting of 
the present invention. 
EXAMPLE 1 
In-Silieo Search Strategy 

A protein sample can be prepared for mass spectrometric analysis by standard techniques (8) If a 
specific proteolytic enzyme such as trypsin is used the peptide will be cleaved at its K and R residues 
(except where followed by Proline). This process is illustrated in Figure 1. 

These peptide fragments are introduced into the first stage of a tandem mass spectrometer through 
a variety of techniques (9) (10). There are generally three stages of MS/MS operations. In the first stage 
the mass spectrometer performs what is known as the precursor ion scan (PIS). The PIS gives an overview 
of the tryptic fragment masses in the sample. In the next stage, the MS can then act as a filter to selectively 
pass fragments within a certain range into the next chamber. Here the tryptic peptides are allowed to 
fragment through collision with trace gases (e.g. N 2 ). The next chamber is used to accurately measure the 
mass of collision-induced fragments, which are selected individually from the second chamber, usually in 
order of abundance, This last stage can be time consuming if there are many fragments. Mass 
spectrometry measurements consume the sample, so if the sample is small, it may run out before each of 
the fragment masses found in the PIS can be processed all the way to the third stage. This is especially 
true for systems employing small volume liquid separation methods to introduce sample into the 
instrument. 

Using the conventional techniques for analysis (5) the spectrum obtained from this stage can then 
be used to generate a peptide sequence, but will fail to sequence peptides that either do not exist in the 
protein database or those peptides that occur as a result of nucleotide polymorphisms. 

Using the hardware accelerated search system described herein, the individual steps of the MS 
process are not modified in any way. However, following the first sequencing, it may not be necessary to 
process all fragments in the PIS stage using MS/MS techniques. If one can quickly locate the gene of 
ongm m unannotated genomic DNA sequence for the first tryptic peptide fragment that has been 
sequenced by the MS/MS stage, then one can infer the masses of other tryptic peptides arising from the 
same gene, and identify them from the list of masses at the PIS stage directly. This strategy effectively 
reduces the database search size to a gene-sized window setting for pursuing a statistical scoring scheme 
35 using PIS mass information detailed herein. 

To this end, all possible DNA sequences that could have coded this fragment are generated (i e 
the peptide query is reverse translated from amino acids into strings of all possible codons). This is quite 
different from conventional approaches that apply 6-frame translation to the database and search in amino 
acid sequence space. There may be multiple codons for each of the amino acids in the short subsequence 
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of the tryptic fragment as detected, and thus multiple query DNA sequences to search for. Intuitively this 
requires a wildcard query approach. 

It is also likely that there will be several matches in the human genome for the query sequences 
smoe they are relatively short in length and may exist in many protein, To resolve which of these hits 
actually is the unknown protein, a section of the DNA surrounding each of the hits is taken into 
cons.deration. The section will encompass approximately a gene-sized window on either side of the hit 
Tins section is immediately reverse-translated in silico to detennine what coding regions it contain, Since 
the ongmal sample was trypsin digested, the same procedure is applied to the translated sequence (i e it is 
spht into several peptide fragments at its K and R boundaries, excluding KP and RP). The mass of each of 
these smaller fragments is then determined and compared against the list of masses detected by the PIS If 
the two masses match within some specified tolerance, a sequence is assigned, and aggregate statistics can 
be used to determine the likelihood of the correspondence of individual tryptic fragment matches between 
the PIS data and the gene found. 
Algorithm overview 

A peptide query can be obtained from the spectra produced by the third stage of the MS (4) Each 
ammo acd in this query sequence is translated to the possible codons that it could have originated from 
(one such example shown in Figure 2 a). Each of these potential codon strings is provided as a 
simultaneous parallel query to the human genome database. Any locations in the genome which contain 
these coding regions are flagged (Figure 2 b). 

If there are multiple coding regions, the DNA sequence on either side of the hit location is 
considered. A gene-sized window of DNA (10 Kbases in the current implementation) is selected on either 
s.de of the hit and translated into its peptide and corresponding tryptic fragments (Figure 2 c ). The mass of 
each of these fragments is then compared to the list of masses generated by the PIS. If mere are masses mat 
match within some user-defined threshold (usually < 1 Da), aggregate statistics for each match are 
recorded (Figure 2 d). Based on the MOWSE scoring algorithm (6), matching peptides are ranked based on 
then- frequency of occurrence. The score for each hit then corresponds to the likelihood that peptide 
matches are random. 

The basic flow of the algorithm can be summarized as follows: Each possible coding region of a 
query peptide is identified and returned along with score indicating the likelihood that this region is the 
true coding sequence. There are several advantages to this approach. Firstly, the last stage of the MS 
descnbed above may not need to be repeated several times. Furthermore the final step of ordering 
mdmdually sequenced peptide fragments can also be eliminated. Software implementations of similar 
methods tested have no capacity for wildcard expansion and are not fast enough for high-throughput 
protein sequencing as the genome search takes approximately 3.5 minutes per spectrum on a 600MHz 
Pentmm processor, which would be expected to scale to only 52 seconds on an Intel 2.4GHz processor 
commonly available on PGs. 
Hardware FPGA Implementation 

To leverage the advantages of the solution described herein and obtain real-time performance, a 
hardware FPGA implementation of the process outlined herein has been built. Three key components are 
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recced: Primarily, a search engine is needed to locate the possible coding regions of the peptide within 
the genome. A mass calculator is needed to produce the masses of tryptic founts in the gene window 
surrounding all potential match locations. Lastly, a means of evaluating the likelihood that a given location 
m the genome is the true coding region for an unknown protein is required. The scoring unit compares the 
masses generated by the calculator to those found in the PIS, and ranks hit locations based on the quality of 
the match. J 

Implementation 

, ™ T™\ deSCribe<1 herCin bCen Pr0t ° tyPed ° n * e ° f T0ront0 ' s Tnmsmogrifier 

IZTl P ^ ™ 3 " 8 Pr ° t0typinS b03rd With f0Ur ^connected Xilinx Virtex 

2000B FPGAs, onboard RAM and various IO connectors to allow the addition of peripheral devices It 
also has a software interface that allows it to communicate with a host PC. It allows for search speeds of 
700 MB/s or approximately 1.9 Gbases/s. 

To implement the above algorithm, the onboard RAM is loaded with the genome, and the FPGAs 
are loaded with the search engine, mass calculator and scoring unit. The device is initialized by sending in 
the peptide query followed by the list of masses detected in the PIS. All locations in the genome which 
could possibly have coded the query peptide were first searched and then the masses of the surrounding 
trypfcc fragments were calculated. If a significant number of matches are found between the calculated 
fragment m asses and the PIS, a likely coding region has been found. The host PC then receives a list of all 
locations at which the query was found along with the score indicating the quality of each of these 
zv matches. The general flow is depicted in Figure 3. 
Human Genome Database 

The genomic database sequence is loaded into the onboard RAM on the TM3 when the device is 
•nmalized. It is stored in a 3-bit encoding which allows for eight possible characters, five of which are use4 
(A = 000, T = 001 ,C = 010.G = 100 and N =100 for ambiguities). Substantially more RAM may be used 
on board to store the entire human genome. The encoded database is obtained from a FASTA file which 
1S translated from its text form into the 3 bit version in software. When the device is initialized, the 
encoded database is loaded into the off-chip RAM surrounding the FPGA. 
Search Engine 

As described herein, the primary objective of the algorithm is to identify all possible locations in 
the genome from which a peptide may have originated. To accomplish this, the user provides a peptide 
query (inferred from the spectra generated by the last stage of the MS), which is simply a string of amino 
acute. Each amino acid in the string is translated (in software) to the codons from which it may have been 
synthes,zed with optional wildcard, These strings of codons are converted to the 3-bit encoding described 
herem and sent to the search engine. Thus the query entering the search engine is no longer in amino acid 
form, but rather a set of all possible DNA strand, The search engine will report the locations within the 
genome in which a query string is found. 

One of the advantages of the 3 bit encoding is that it allows for compression of the query string. 
Consider for example the amino acid query : Pro-Arg-Ser-Al, There are six possible codons for Arg and 
Ser (F.gure 4). They can be compressed to two unique codons and one codon with a wildcard on the 



25 



30 



35 



WO 2004/109579 



PCT/CA2004/000853 



14- 



10 



15 



20 



25 



30 



35 



wobble-base. Thus, eaeh amino acid can be encoded into three codon registers or less. Note that Figure 4 
unplies that there is a hierarchy of units within the search engine. At the lowest level, there is an amino 
acm unit, which accepts potential query codons (from the MS) and memory codons (from the genome) as 
inputs. If any of the query codons match a memory codon the amino acid unit indicates a hit Only a single 
comparison is needed to test all potential codons in a single unit against a single memory codon 

The next level of hierarchy is the peptide unit, which consists of several (10 in this 
•mplementation) amino acid units. If all of the amino acid units in a peptide unit indicate a match, a 
memory string corresponding to the query has been found. This is apparent from the structure shown 
above, as it implies that a sequence of memory can be grouped into codons that translate into the query 
ammo acid sequence. For the example above, if the memory string CCC AGO TCA GCA was read in from 
memory it would produce a match with the peptide unit shown above. 

Once initialized, the search engine reads in the genome from the RAM and starts comparing it to 
the query strings as shown above. One approach is to compare a memory string to the query strings and 
then read a new base into the memory string and slide it over by one position. In this manner, the search 
engme moves through the entire genome database and compares it against all possible query strings 
Conner the example shown in Figure 5. A match to the query string clearly exists in the database string 
However, to discover this match, the memory string must be shifted by nine bases. In the naive 
nnplementation described above, this would be accomplished by multiple serial comparisons as nine bases 
are shtfted in. A better implementation would have multiple copies of the query aligned with successive 
posmons within the memory string. In Figure 5, if there were 10 copies of the query, the first aligned with 
posmon one (as above), the second with position two and so on, the 10* copy would detect a match with 
the memory string. 

To operate at the full memory bandwidth of the TM3 (1 memory word per cycle) all these 
comparisons have to take place in a single cycle. In the hardware, multiple copies of the query register are 
•mplemented, one for each position in the memory string. A depiction of the query registers aligned against 
the data from memory is provided in Figure 6. All comparisons occur in parallel therefore the query is 
s.multaneously compared to each subsequent character position in the memory string. 

If any of the positions match, a hit to the current genome (memory) address is recorded. Note also 
that the copies of the queries are staggered at one-base intervals instead of one-codon intervals. Due to this 
approach, the three 5 >-3 • reading frames are automatically considered. Note that the DNA sequences in the 
genome are unidirectional. There is only a 5'-3« or 3VS' copy of any given sequence in the genome file To 
account for this, the reverse complement of every query strand is also added as a query to the search 
engme. The complement is also staggered in the manner depicted by Figure 6 which automatically covers 
the three 3>-5' reading frames. All complementary strand reading frame comparisons occur in parallel 
therefore the query is simultaneously compared to each subsequent character position in the memory 
string. J 

With this approach, a single peptide query is converted to all potential coding sequences and their 
reverse complements, and the search engine will find any locations that contain these strands. Each hit 
location must then be evaluated by checking if the MS discovered any tryptic fragments surrounding the 
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hit 

Cutsite Detection and Mass Calculation 

In general FPGA implementations are not efficient in dealing with floating-point arithmetic 
Therefore all calculations carried out are done with 20 bits shifted by a constant factor of 10 2 effectively 
allowing 2 decimals of precision. These can be modified to increase precision with slight area and speed 
penalties on the current hardware. 

Once the search engine has located a match to the query, the significance of the match must be 
determined. As mentioned earlier, there may be multiple matches to a short peptide query and it remains to 
determme which of these matches is the true coding gene. To this end the masses of tryptic fragments are 
calculated around every match. If a certain hit location has several neighboring tryptic fragments that 
correspond to the masses found in the PIS, it is likely that this hit location codes the protein in the sample 

. To obtain the mass of fragments surrounding the hit, the genome, is passed through a shift 
register, which acts as a buffer. The shift register delays the RAM words and keeps them in the device. 
. When a hit is detected, the calculator begins accepting words from this buffer; if the buffer is the size of a 
gene, calculations effectively begin at one gene window size preceding the hit location and end 
calculations after one gene window size following the hit location. In an implementation the size of a gene 
is assumed to be 10K bases and therefore tryptic masses are calculated for a 20K base "window" around 
the hit location. 

The calculation of tryptic masses is straightforward. Each codon in the genome is translated to its 
corresponding amino-acid mass. These masses are accumulated sequentially until a breakpoint is reached. 
The breakpoint can be any codon that indicates a tryptic cut site (K or R if not followed by P) or a STOP . 
codon. Once a breakpoint is encountered, the accumulated mass, corresponding to a single tryptic . 
fragment, is forwarded to the scoring unit for comparison with the PIS list. Once again, in a naive 
implementation, each tryptic fragment would be sequentially analyzed and its mass would then be scored 
However the device is pipelined to match the throughput of the search engine (1 memory word/ cycle). As 
a result, the calculator consists of several processing units that operate in parallel. In a 63-bit memory word 
there are 21 bases or 7 codons. Correspondingly the calculator has a 7-stage pipeline to calculate the 
masses of the seven codons in parallel. ' 

The first stage will buffer the first 63-bit memory word, but only calculate the mass of the amino 
acid created by the first codon in the current memory word. It will also determine if the codon indicates a 
cut-site. In the next cycle, the first stage will receive a new 63-bit word as its input and will pass the mass 
and cut-site information to the second stage, along with all the remaining codons in the first word The 
second stage will then add the mass of the received codon to the mass of the second codon in the first 
memory word, which it calculates. This accumulated mass will then be passed to the next stage along with 
the cut-site information and the remaining codons. The process is repeated for each stage and the masses of 
several tryptic fragments are calculated in parallel. 

At each stage there is a calculator unit that receives the masses of the previous codon and the 
current codon. It also receives information about whether a tryptic cut site or cleavage point was detected 
in the previous stage. These data allow the current stage to calculate new masses and determine whether it 
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The w,l card, represented by the V or -biguity character, which may exist in the genome. If a cut site 
. detected the current mass is saved. If a wildcard in the codon creates an irresolvable amino acid the 
mass is d.scarded. The calculator and detection units are depicted in Figure 7. 

Each stage of the calculator has a calculator unit and a detection unit With the aid of a central 
controller each unit outputs masses to be saved and discards masses that cannot be resolved. In every cycle 
a new memory word is read in and its first codon is processed in the first stage. In the next cycle a new 
memory word , read in and the remainder of the old word is passed to the next stage where its next codon 
» processed as described. The overall architecture of the calculator is illustrated in Figure 8 

As with the search engine, the complementary DNA strand must be accounted for. The tryptic 
masses for both the strand stored in the genome and the reverse complement that it implies muTbe 
ca cu ate . W.th the hardware above, the masses of fryptic Segments from me original land can be 
calculated. For the complement, a copy of this hardware is built which transposes and complements the 
codons as reqmred for the complement In Figure 9 an example string is shown alongside its reverse 
complement Note that to obtain the reverse complement the original strand is transposed and the bases are 
replaced wrth their complements. However, the codons arriving from memory arrive in the order of the 
ongmal strand. As shown in Figure 9, the codons are accumulated in the forward direction for the original 
sfrand, but backwards for the complementary strand. This obviously has no effect on the accumulation of 
tryptic masses, which is an associative operation. 

Bach calculator unit computes the masses of one strand and its complement. This accounts for one 
frame and its complement. To account for the other two frames and their complements, two more 
calculator umts are instantiated; each starts at one base position ahead of its predecessor. This is depicted 
» ttgure 3 as three calculators operating in parallel. All masses are calculated with 20-bit precision and 
the stored values for each amino acid mass are accurate to within l/lOO* Da. 
25 Scoring unit 

When multiple hits are discovered for a query, each hit can be optionally ranked relative to the 
others through the addition of the scoring unit to the searching system. To do this the masses of tryptic 
fragments around each hit are compared to those detected in the PIS. If the tryptic fragments around a 
grven hit match those detected by the PIS, it is very likely that the hit corresponds to the true coding 
30 sequence. 6 

The scoring unit is used to provide a ranking of the gene windows. If multiple hits (windows) are 
detected, only a few of them may be the true coding region for the sample in the MS. The score can be 
used to quickly evaluate which window is the most likely coding region. 

There are two stages to the scoring algorithm. Firstly, a calculated mass must be compared to 
masses detected by the PIS. Once the closest match in the PIS is found, the difference between the two 
masses is computed. If this difference is within a user-defined threshold, a match is indicated These 
thresholds can also be used to consider standard amino acid mass variants such as oxidized states or 
translate! modification, The second step is to assign a score to each of the matching masse, The score 
.s used to evaluate the likelihood that a given match is not random. Scores are generated using techniques 
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similar to those used by MOWSB (6) and rely on the assumption that for a true match, a statistically 
improbable number of matches are observed within a gene sized window to masses accumulated in the 
PIS. 

5 bam ^J****** 1 ^ MS maSS6S S6nt t0 SC ° ring Unit > Which -es them in on-chip 

T 7Z ll ^ fr ° m *° Cal ° U,at0r ^ 8enerated ' ^ ^ be wi * *• ^red masses 

from the PIS. This process corresponds to the first step described above. 

A data-associative indexing scheme is used to facilitate rapid comparison, The on-chip RAM can 
essenually be thought of as a set of mass bin, Masses felling into a certain range are stored in the bin 
correspondmg to that range. Consider for example RAM of depth 2048 - there are 2048 unique storage 
10 locations (mass bins/mass ranges) available. Masses can range between 0 and 10485.75 Da, since they are 
20 bit values (2 = 1048576) and all floating point numbers are treated as integers shifted by two decimal 
places. In the data associative storage scheme, the 11 ( 2" = 2048) most significant bits (bits 19 to 9) of 
the mass are used as an address at which to store the mas, Note however, that in a 20 bit mass this implies 
ft* masses to be stored must be greater than 5.12 Da apart since there are 9 non-address bits (bits 8 to 0) 
(2 - 512). This is a constraint on the values in the PIS. It can be modified by adding more storage (i e 
more *an 2048 locations) but this will result in greater area usage on chip. Mass fragments generated by 
the calculator are then used as addresses to retrieve their closest matching PIS value, The difference 
between the calculated mass and the stored PIS mass is then calculated. If it meets a user-defined threshold 
(between 0 and 1 Da), the current calculated mass is flagged as a match. 
20 For this matching mass a score must then be calculated. This is done using a technique similar to 

that used to calculate the MOWSB factor matrix M. The M matrix has as its elements 

fu 

m U ~ FT 1 where/is an element of the frequency factor matrix F 
I J <J(max)l 

The frequency factor matrix is a histogram of frequencies spanning the observed peptide mass 
range over the gene-sized window. The MOWSB factor matrix M then, is simply a normalized 
25 representation of these value, The frequency factor matrix F has columns ttat represent intervals of intact 
protem mass. More importantly, each individual column has several rows which represent 100 Da intervals 
» pept.de mas, As peptide masses are generated, the appropriate row is incremented to keep track of how 
frequently masses fall within a certain range. When matching masses are found, a score is generated for 
each entry based on the formula : 

30 Score- 50000 



Where Mprot is the molecular weight of the protein in the traditional MOWSB search. Since the 
.mplementation does not utilize intact proteins the following representation is used as the score for a 
window. 

SCOre = (tl m ij) WhCre K ^ a SCaHnS faCt0F be SCt by *" USen 
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md.vidual gene windows and then calculates the score using the formula above. 



Note also : 



J max J max /max 

_ n/ . . „ 

if \m wherew is the number of matches. 
\Jmax / 



10 



15 



20 



25 



30 



The evaluator consists of a frequency table with 128 sets of 82 Da bins (Figure 11) These 
represent the rows of the F matrix. Each new mass that is computed is passed through l^l^Z 

r 1 rir l ,r::r ? t ms - n ° k - - * - - ™ £z - 

tall into the 20-bit range. The allowable masses are between 0 Da and 10485 75 Da Th. 1 <>« k- 

bins into 81.9 Da range, Once all the masses in a window have been considered the frequency table Z 
have a count of how many tryptic fragments fall into each different range 

table in 1° T ^ "* CalCUlated ^ 0nCe ^ *«* *• frequency 

table, n tins pass the table already has the frequency with which a mass in this range was detected in I 

current gene window. This frequency is passed to a logarimm unit which calculates loj^ ^ of 
current mass , The use rf ^ ^ ^ ^ £ZZT- 

Zwa bTr d 7 P,eXity r — 48 " hardWare ™" are c^t; ! 

hardware by lookup tables since only the logs of integer values over a relatively small finite range 2 

requuedTheselogarimmscanmenbeaddedtogethertoobtaintheproductterm. 

- m m 

n//=5>g(/,) 

PC to 7 teim ' al ° nS ^ 1116 maXimUm lfreqUCnCy Md nUmber ° f -"Che* is returned to the 

PC to calculate the frequency given by. ° me 

Score = K 
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Results And Conclusions 

10 bit. m b? T^T"' ' °~ »»" " — «- «oo «, oyc.e. B.oh wo,d i. 6, 

. Obases/sec. This „ resides on o„. o f ,hs ton PPO As on the TM3 and osea .ppe.xtoa*,, J. ^ 
total look-up table (LUT) capacity. 

each chip^Zf^ t T" ^ ° CCUPy ^ rCmaining threC ^ ^ - *— - 
ch,p. Each of the frame ch.ps has 99% slice utilization (28K LUTs and 448K RAM bits). The 

15 congest™ here restricts the routablilty of the circuit and limits the speed to 64MH, The Virtex 2000E 
have 43K LUTs and 614K RAM bits in total. On a larger device such as the Stratix S-80 wiuV79K^JUT^ 

performs^ T. ^ ^ rePreSentS * ** *"* * ***** ^ -ftware'that 

20 i r o SeqU6nCe ^ MS/MS d3ta (e - & LUtCfiSk (4) in «** *> P-ide input to the 

be ntegmted wrth software to present the information for further processing. Modules have been built that 

e^Zle 7 ° Ver,aP P6PtideS ^ S ° lve gene of detected 

25 th , ™^T pledescrites ^ 

the system takes three primary inputs, namely: 

1. A peptide query from the MS, which is a string of 10 amino acids or less, 

2. A genome database, 

3. A list of peptide masses detected by the MS. 

30 The design produces a set of outputs for a given peptide query: 

1. A set of gene locations, which can code the input peptide query 

2. A set of scores for each gene location. The scores rank the genes based on the likelihood 
that they coded the protein in the sample. 

35 - , T W ** fi " ^' " a " ^ *= P=P««« "tttoo- and thon 

35 »„„.«, „, _ loc>tlons ^ ^ r ^ ^ f J - 
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The design is divided into three major subunits: 

A search engine that locates all possible coding strands for a peptide query 
A b-yptic mass calculator that translates all matching genes and produces the masses of 
all the corresponding tryptic peptides from the translated gene 

A scoring unit that compares calculated peptides against those stored in the PIS of the 
and ranks the matching gene locations. 
This architecture is depicted in Figure 12. In the following sections the inputs are described and 
how they are encoded within the system is explained. Each of the units in Figure 12 is desled * 
flow of data through the system is detailed. ^ 
Genome Database Coding and Compression 

The genome database is one of the primary inputs to the system. To better understand the nature 

The genome database is stored as an ASCII fi,e of bases, and is available for download from 
severa, Afferent mstitutions. The ASCII representation uses S bits per character, which a,,"" 

f * ^ - -™ — of the storage space is wasted. This ASCH £ is 

1 1 l !n a vaiue that a,iows fw 27=128 -~ »* — — i of 3 

characters and the characters themselves can be one of five vaiues.. Therefore there are 53=125 unicue 
«-» in me actual genome database. For examp,e AAA - 0000000, AAT = 0000001, AAC = 0000010 
e^Th, encodmg uses 2.3 bits per base wasting on,y 2. 3 o /o of the storage space (125 of 128 possibles 

Since the genomes of most organisms are large (15 million to 3.3. billion characters) it is not 

— r^r- database directiy ° n - cMp - ~ - — — - ™* ~ 

As the genome is read from external RAM into the device, it first passes through the decoder units 

lztt* codon using *• ori * nai 3 - wt enc ° dtag — * — ^ thenz: 

BlockRAM umts that are configured as ROM, They accept the compressed string as an address and 
produce an uncompressed bit-string as their output. 

BV . ^ 7°° m P reSSed bit -^g uses 3 bits per base that allows for eight possible characters five of 
wh 1 chareused(A = 000,T = 001,C = 0^ 

represented by a 9-bit value within the hardware as shown in Figure 13. The rest of the hid™ 
descnbed m the fol.owing sections also use the 3-bit encoding scheme described above 
Peptide Query 

amino ^ T " "t ^ ^ " ^ iS * ™« d ° S « <?* a string of 

ammo ac,ds> Th,s must be converted to an equivalent DNA representation to be compared against a 
genome database. Consider for examplethe case when me MS outputs rae peptide sequ™^ 
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goal of the algorithm is to locate all genes that can create this peptide. 

Therefore each amino acid is translated into the codons that it could have originated from The 
P ept,de query is a string of no more than 10 amino acids (including wildcards, This query size was chosen 
based o„ the average size of the sequencable portion of a tryptic peptide (appro, 10 amino acids) and the 

sequence ° f amin ° adds ( ° ften - *- 7 > - — * «-* - — * 

The wildcarding of searches is allowed by the inclusion of a wildcard character in the query. This 
also serves to compress the query, as some amino acids with multiple codons will not need each codon 
exphctly enumerated (for example the amino acid Alanine (A) in the query above is expressed as GC) 
This reverse Ration is done on the host PC when the peptide query is received from the MS. No more 
than three codons are needed to encode any amino acid when wildcards are employed. Thus each amino 
acd » reversed translated in the peptide to generate a codon, or DNA query that encapsulates al, the 
possxble codmg strands for the peptide query as shown in Figure 14. Each of these DNA/codon queries are 
then encoded using the 3-bit scheme described above. 

Genetic sequences are stored as either original DNA strands or their complements, but never both 
smce this is redundant. In the 3-bit encoding scheme, no information is stored to indicate the type of 
strand. Therefore the complement of every strand in the database is considered to ensure that all possible 
codmg patterns within an organism's genome are examined. For this purpose, the complement of the query 
.also generated. Thus the original peptide query is translated into six binary strings, three for the original 
DNA strand representation and three for its complement. The query, thus encoded, is submitted to the 
search engme, which locates all instances of the coding stands in the genome. 
Search Engine 

The primary objective of the search algorithm is to identify all possible locations in the genome 
from which a peptide may have originated. To accomplish this, me user provides a peptide query (inferred 
from the MS data), which is simply a string of amino acid, To compare these amino acids to a genome 
(DNA) database they must be reverse translated to codon, The search engine takes these strings of codons 
as input, and outputs all positions within the genome that match the string, 

The purpose of implementing the search in hardware is to maximize speed. This speed is 
governed by the frequency with which the memory containing the genome can be clocked through the 
search engine. The parameter MEM.WIDTH is defined to be the width of a memory wdrd that is read into 
the search engine, i.e. the number of bits read into the system in every clock cycle. Thus the total number 
o clock cycles required to search through a genome in memory (with a size defined by 
SIZB_OF_GENOME)isgivenby: 

35 ■ SIZE OP fi HNOMB 

MEM_WIDTH 

Consequently the total time to search through the database is given by: 

Total_Search_Time = SIZE OF OPMniup x l 

MEM.WIDTH SystemJFrequency 
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Note that the total search time must be less than Is for the search engine to be useful in the de- 
novo sequencing method. Furthermore, there may be other applications that require high-speed searches of 
the genome. 

Search Engine Operation 

The search engine accepts queries, which consist of a set of DNA strings and their complements 
and locates every position within the genome that matches any of these strings. The genome, which is 
stored in the RAM, is clocked in as a series of MEM_WIDTH-bit memory words. On every clock cycle 
the controller reads a new memory word into the system. This word is compared to the set of queries 
provmed by the user. If a match is detected, the search engine controller returns the current memory 
address, which the user can then use to locate the coding gene. The VHDL (Very High Speed Integrated 
Crcurt Hardware Description Language) description of the search engine controller is provided in Table 1 
(control.vhd). A depiction of the architecture of this device is provided in Figure 15. 

Once reset, the search engine controller enters initialization state in which the six DNA queries 
are read into the search engine. This is done in two clock cycles: one for the original DNA query, and one 
more for the complementary query. In the example in Figure 16, a simplified view of the architecture is 
presented, in which a single DNA query is performed. Note that the complementary query shown in Figure 
15 .s removed for simplicity, however the search operations performed on both strings are identical The 
controller then moves into the comparison state in which memory words are continuously read into the 
search engine from externa. RAM. With a new word entering the engine in each cycle, every substring 
wnhrn the memory word must be compared to the query in a single cycle. To do this, multiple copies of 
the query are registered in hardware, and each one is simultaneously compared against the memory word 
Note that as many copies of the query are needed as there are bases in the memory word. This is apparent 
in the architecture shown in Figure 16 as each copy of the query is aligned with a successive base in the 
memory word. 

Using the compression scheme of 7 bits per codon, the number of bases in a single memory word 
is parameterized as: 

NUM_BASES_IN_MEMWORD =MEM_WIDTH x 7/3 

Each copy of the query is stored in a peptide unit, and if any peptide units signal a match (as 
query 4 in the example in Figure 16), the controller exits the comparison state and returns the current 
memory address to the user, to be interpreted as a coding region for the query strand. The search engine 
then returns to the comparison state and the process continues until all the memory has been read. 

It is apparent that the peptide units mentioned above are responsible for the core functionality of 
the search engine. To elucidate the details of the design, a description of the peptide unit follows. 
Peptide Comparison Unit 

The search process described above compares several identical copies of the query to a memory 
word to maximize throughput. Each query is stored in an individual peptide unit 

A peptide comparison unit takes two inputs: 

(a) A set of query codons (corresponding to the amino acids in the query); 

(b) A set of 1 0 codons from memory. 
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ompared agamst the query codons. Each unit produces a single match output that signals whether the 
codon from memory matches any of the query codon, If all of these match signals are activated 
-^-ously, a sfring of codons from memory that matches a set of query codons has been found, l^e 
VHDL descnpnon that instantiates the peptide comparison unit is presented in Table l(protein vhd) 

h> Figure 18 a simplified peptide comparison unit is depicted in operation. There are 3 sets of 
query codons which are spared to the codons from memory, m Figure 18 the matching codons are 
h.ghhghted. If at least one codon from each set showsamatch to memory, the query has been found in th 
genome, or equivalently, a coding strand for the peptide query has been found. 

Thus each of the codon sets signals a pipelined logical AND unit, and if all sets indicate a match 
*e pept.de umt signals a match. A wide AND operation (logical AND with many inputs) will incur 
secant delay ,f rt is to be completed in a single cycle. To avoid this delay and ensure fast circuit 
operatmn, the match registers signals from the units, then AND them as a pipelined operation 

F lg ure 19 contrasts a simple wide AND implementation with the pipelined version described 
above In the non-pipelined unit, there is a comparatively long logic delay as the input pass through 
multiple gates to produce the output AND signal. If this delay is sufficiently high, it will constrain the 
ma™ dock frequency of the circuit. In the pipelined implementation, the inputs are divided into two 
groups. Each of these groups is individual ANDed in a sing.e clock cycle. The results of this operation 
are stored m intermediate registers and ANDed together in the next clock cycle. This technique reduces the 
de ay through logic and allows faster circuit operation. Note that the output of the pipelined AND is 
delayed by an additional clock cycle, but this is usually acceptable as the clock frequencies are sufficiently 
high, and the penalty of an extra cycle is negligible. 

Figure 17 depicts the peptide unit as a set of codon units, as described above. It is the match 
signals from each of these codon units that are ANDed together to verify that al, codons have detected a 
match m memory. These codon units are the building blocks upon which the search engine is built 
Codon Unit 

The smallest fundamental unit of the search is the codon unit, which takes a set of three query codons 
and a angle codon from memory as its input. It produces a match signal as its output If any of the three 
query codons matches the memory codon, the match signal is activated. The set of three codons 
corresponds to the translation defined above. Any amino acid can be represented as set of three codons or 
less. Thus a codon unit essentially determines whether a codon from memory is capable of coding a query 
amino acid. 6 H * 

The operation of the codon unit is shown in Figure 20. Assuming that the query amino acid is 
Argmme (R), rt is translated to its equivalent codons AGA, AGO and CG*. This is done in software before 
the queo, is submitted to the search engine hardware. These three query codons are stored in the codon 
umt, and at every clock cycle.anew base from the genome in memory is read in and compared against the 



Figure 21 illustrates a detailed view of the codon unit The bases in the three query codons 
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divided by position, i.e. the first base in every query codon is ANDed with the first base for a codon from 
memory, the second query base is ANDed with the second memory base and so on. From Figure 21 it is 
apparent that the codon unit only signals a match if each base from memory matches at least one query 
base in its corresponding position. The VHDL code that describes this architecture can be found in Table 1 
(amino, vhd) 

It is the match signal shown in Figure 21 that is passed into the pipelined AND in the peptide 
comparison unit, and ultimately to the controller, which then detects a hit and returns the corresponding 
memory address to the user. 
Interpreting Search Engine Outputs 

The search engine identifies memory addresses that contain a section of DNA capable of 
synthesizing the query peptide. In a biological sense, this corresponds to identifying coding genes within 
the genome. Figure 12 indicates that the gene at the hit location is then sent to the tryptic mass calculator 
for further processing. 

However the stream of DNA from the genome database, which passes through the search engine 
has no markers to indicate the start or end points of a gene. To overcome this lack of information the 
average size of a gene is used to delineate the gene under consideration. 

Defining the size of a gene as GBNE_SIZE bases, a 2 x GENE.SIZE window of bases 
surrounding the hit is sent to the calculator. This approach, as shown in Figure 22, allows the consideration 
of one gene preceding the hit and one gene following it. In practice, this window is implemented as. a 
GENE.SIZE sized shift register. The input data to this shift register is obtained from the output of the 
decoder blocks described herein. This data is in the uncompressed 3-bit form; therefore the depth of the 
shift register is GENE.SIZE x 3 bit, Data from the decoder is continually passed into the gene window 
regxster, which acts like a delay element, as its outputs are delayed by GENE.SIZE (its depth) relative to 
its mput When the search engine detects a hit, the output of the gene window is sent to the tryptic mass 
calculator, which continues to read the gene window until it has processed 2 x GENE.SIZE bases 

This technique ensures that the calculator processes a reasonable amount of genomic data on 
e,ther s.de of the hit location. However, the fixed size of the gene window adds an inherent error to further 
operations, as most genes will be of a different size. Regardless, if a reasonable portion of the gene is 
processed, it will still be possible to identify many of the peptides from the translated protein. 
30 Summary of Search Engine Design and Operation 

The original peptide query is translated from amino acids to sets of codon. These codon strings 
are stored in the codon units that make up a peptide unit. Multiple identical copies of the peptide unit are 
mstantiated to maximize the throughput of the search. The search engine progresses incrementally through 
the address space of the genome stored in RAM, looking for a match to the queries. If a match is found, the 
current memory address is sent to the user as a gene location that codes the peptide query. Genomic data 
surrounding the hit location is then sent to the Tryptic Mass Calculator as illustrated in Figure 12. 
Tryptic Mass Calculation 
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closely matches the protein sample in the MS. 

^on a„ d calculation operations described above fc ^ ^ ^ _ ^ 
*e calculator unit must translate the matching genes from the search engine into amino acids and locate 
10 *e tryp c cut-ates. To obtain tryptic peptide masses, the sum of masses of the amino acids from cut-s^ 
to cut- S1 te * accumulated. These masses are then sent to the Scoring Units as illustrated in Figure 12 

tk TxTT ° f ^ ^ CalCU ' ati0n Pr ° CeSS ' an 6Xample ° f * e ^ fa ^ed is set out below 
The DNA data from the gene window, i.e. the matching genes, are interpreted as a stream of 
codons, or equivalent* as an amino acid string. In effect, the gene is translate* to its corresponding 
15 protein as shown in Figure 23. • wiapnwmg 

0--P-tem is translated, its tryptic peptides must be compared to those detected by the MS. 

T T"" ^ ^ ^ Pr ° tein > ^ — **» *«» *e tryptic cut-sites (Lysine 
(K ami Argmme (R) ammo acids) and calculates the accumulated mass of all amino acids between Le 
cut-sites as illustrated in Figure 24. 
20 Calculator Architecture 

^ ^ rrT 1 view ofthe caicuiator 88 de ^ 

the translation, digestion and peptide mass calculations described above. 

At every clock cycle, the controller for the calculator reads a new set of 

^^~^j^^/~^'^ B ^ 4W ^^ * >aSCS *" >m * 6 8enC W " M * ow " Ito * e ca l c uktor. The calculator opiates 
1 data »n codon-sized units. Note that each stage of the calculator in Figure 25 has a single active 
coon attached toadetection unit and mass lookup table. The first stage of the calculator transit its first 
codon into the mass of its corresponding amino acid, which in turn is passed to a mass accumulator. In the 
next clockcyclethe controller readsanewset of codons fromfhe gene window into the calculator, and * 
— g processed codons from first stage are passed down. In the Tryptic Peptide Masses second 

0 Ca,CUla OT T C ° d0n ^ ^ ParaUel *» firSt «*» ^m the new set The 

accumulator from the first stage passes its calculated mass to the second stage. Thus the mass of the first 
ammo acid can be added to the mass of the second to calculate the mass of the peptide. If the detection 
^^^^ Conine or Lysine amino acids not Miowed by Pro,ine), digestion occurs 
^d the accumulated pept.de is output from the calculator. Each stage of the calcu.ator operates in an 
identical manner by receiving a set of codons, performing emulations on only a single codon and 
uffenng the rest These remaining codons are passed to the next stage in the subset clock cycle 1 
the proc ss .repeated until the entire gene has been processed. The VHDL representation of the behaviour 
of the calculator is given in Table 1 (mod_calc.vhd). 
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The matching gene is passed as input to the calculator, NUM_BASES_IN_MEMWORD at a time 
to match the memory throughput The calculator operates on these bases in codon-sized units; therefore 
NUM_BASES_IN_MEM WORD/3 codons (defined as NUMCODONS) are clocked into the calculator in 
every cycle. To maintain this throughput, the calculator needs at least NUMCODONS stages operating in 
parallel, as there could be at most NUMCODONS peptides in a single memory word However, if a 
peptide spans more than a single memory word, the accumulated mass from the first memory word will 
have to be saved until the tryptio cut-site is detected in one of the following memory words. Thus an extra 
pipeline stage is required to accumulate intra-word peptides, resulting in a total of NUM_CODONS +1 
stages operating in parallel to ensure that the calculator can meet the memory throughput 

For every hit detected by the search engine, the calculator processes a full gene window of bases. 
Thus for every hit, the calculator operates for a total of GENE_SI2E/NUM_BASBS_IN_MEMWORD 
corresponding to one cycle for every memory word in the genome. An additional NUM_CODONS+l 
cycles are required to process the codons that will remain the pipeline of the calculator. The following 
sections provide a detailed description of the architecture of the hardware used to perform the mass 
15 calculations. 

Mass Calculation 

For a detailed account of the operations performed by the calculator, consider Figure 26. 
Each stage of the calculator only processes its active codon, which is fed into a lookup table of 
masses and a set of detection units. The mass lookup table reads the codon and produces the mass of the 
corresponding amino acid effectively translating the codon. The detection unit looks for tryptic cut-sites in 
the codon stream. If no cut-site is detected, the mass of the previous codon is added to the mass of the 
active codon. However, if a cut-site is detected, i.e. the end of a tryptic peptide is reached, the accumulated 
mass is sent to the calculator output instead. Thus the detection units and mass accumulators control the 
digestion and calculation operations of die calculator. 
25 Mass LUTs and Detection Units 

The mass LUTs are implemented as ROM tables which accept a 6-bit codon as input and provide 
a mass value, which is NUM_MASS_BITS bits wide, as output. A codon size of 6 bite implies that only 2 
bits are used to represent each of the 3 bases in contrast to the 3-bit per base scheme described thus far. To 
explain this disparity, consider the binary representation of the codons. With only four real bases A,T,C 
and G, a two bit representation is sufficient to encapsulate all possibilities. The third bit is used to represent 
the wildcard character. Thus every mass is represented by two data bits and a single wildcard bit. As the 
mass lookup table is instantiated in BlockRAM, using a 9-bit input for every codon (3-bits per base) would 
require 2 9 = 512 storage locations of NUM_MASS_BITS size in the BlockRAM. By using only the two 
data bits of a base, a codon can be represented in 6 bits. Such an implementation requires only 2 6 - 64 
storage locations. The controller for the mass calculator uses the wildcard bit in combination with the 
wildcard detector to determine whether there is sufficient information to translate the codon into its amino 
acid mass. 

The cut-site detection unit looks for the presence of a Lysine (K) or Arginine (R) amino acid in 
the codon stream Recall that trypsin cleaves the protein at these amino acids provided that they are not 



20 



30 



35 



WO 2004/109579 



PCT/CA2004/000853 



-27- 



10 



15 



20 



25 



30 



35 



Mowed by Protme. mus to ^ daecUOT ^ l00k , ^ B fte ^ coA)n 
to pmsenee of „, oodo. to. _ „ ^ » 

detobon tmtts ^e . «* oodon „ ^ » d outpul . ^ „„ . ^ or " 

codon was found in the input codon. proline 

n. wibfcard detodot, mm mobs to to p^ „ „ fmsolvabk 
~«* Tbe pmaemc. of. mld card or "N" „ . codon tes not „ w " 

rir;: be r** 4 ,n some ° f ^ ■ is am - *^ - ~~ 

Tho wddc^d _ ^ . 4-bt, tap* bonding «. to te „,„ ^ „ . md 

^vtd- a «. m^t, which ls e 0 „ b „ed ^ fte ^ ^ ^ 

^ZZT M **" to - - *- - — — * • — 

Complementary Strand Calculations 

Aa wU, to ae^b ,„„„,, a. cp,^,,^ DNA «d „„„ be .eootmted to. -a. ^ 
to botb to aored ta fc ^ „ comptaneB ^ * 

batdreare abo™, to of ^ ^ _ fc ^ ^ _ w J 

znrr *■* : t ""^ h - ,w * - . *. ood^. „ 

to 1 1 pTT V " *™ *** "* • ' ^ — of 

™" s for *■ -*-->' — - - 

-M. «. TO T *° C ° n """° ra "' *• 0^i!i "' ,, ! ' n "" 1 " '™ i,OSed - "» - -PlO0«d 

wto to. amplM . Thls oorro^od, to to ,e™raed dbecto. However, to ood^ read 

28 (., Tta «- -d. „ aoo™„, aM in to fonv„d d,™»ion to to ortpjn., ^d <„ road 
from memory), but baokwuds for to oomplomontoy slrand. 

ond of " OTe ' y „ i, " Plie, "* *' «*!-»-y *»d, byptio m„a e^eul.bctt, will besta „ to 
o»d of to protem bfcaa aeoom u ,. tion „ .„ pr0C e„ wtdch „ fc 

which its input codons arrive. 
Six Frame Mass Calculation 

Bach calculator unit computes the ma sses of one strand and its consent. This accounts for one 
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Summary of Tryptic Mass Calculator Operations 

The search engine identifies locations in the genome that can code the query peptide. The genes 
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each gene. These masses are compared with the peptide masses detected by the MS to uniquely identify 
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the true coding gene. 
Scoring unit 
Overview 

From Figure 12 it can be seen that the calculator descrihM W„;„ a 

5 peptides for all genes that coded the peptide query ^7 H ^ 

ic pepnae query. These calculated masses are then comoared ™a ^ 
ma.es detected h y the MS to determine which g ene actually codes the protein in die I"e « 
elaborates the representation of the scoring unit shown in Figure 12. The VHDL descriot.cn rl 
in Table 1 (scorer.vhd) ' description of this unit is 

The inputs to the scoring unit are the calculated tryptic masses and the PIS list from the MS After 
10 comparing the two sets of masses, the unit produces a score indicating the quality match ThuT Z" 

- serves to ran, each hit (or gene Wmdow) m order of significanl 3^ ^ ^ 
h, l*ood that a given gene window contains the gene that actually codes the protein in the input ^ 
Tne stance is computed using a histogram that records the frequency of occurence of L £ * 
To compute tins score, the hardware operates in three distinct states- True PIS <* u 
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their most significant bits are identical. To avoid this situation, the design is constrained such that 
ADDRBITS must be sufficiently large enough to ensure that data will not be overwritten. Upon device 
initialization, each of the PIS masses from the MS is stored in the on-chip RAM using this technique. 
Histogram Construction 

5 In the second state, the scoring unit initializes a histogram with NUM.BINS bins. As the mass 

calculator operates, its outputs are passed into the scoring unit The histogram records the frequency of 
occurrence of peptides in different mass ranges. To this end, decoders are used to identify which range a 
given mass fells into and a set of counters is used to determine how many masses fall into a given range 

Figure 32 illustrates how the decoders and counters described are used to update the histogram 
10 Table 1 provides the VHDL description of controller that implements this process 
(modJ5:equency_table.vhd). 

The bins in Figure 32 are simply a set of NUM.BINS registers that are NUM_FREQ_BITS bits 
in width. Bach register, or bin, represents a range of mass and contains the number of peptides in the 
current gene window that fall into this range. The counters at the inputs of these registers identify how 
many of the peptides from the calculator fall into a given range. The counter then updates the bin 
appropriately. The calculator is capable of producing NUM_CODONS + 1 masses in a single cycle. Thus 
in every clock cycle, any bin in the histogram can be incremented by a maximum of NUM.CODONS + 1 
peptides. 

As mentioned, binary decoders are used to determine the range into which a calculated mass fells 
The decoder has ^(NUM.BINS) inputs and NUM_BINS outputs. Each output signal of the decoder 
corresponds to one of the NUMBINS bins. Therefore lo a (NUM_BINS) bits of the mass (defined as 
HISTADDR.BITS) are required to determine the range a given mass fells into. There are 
NUM_CODONS + 1 decoders, each corresponding to single output of the calculator. 

An example of a histogram update is presented in Figure 33 for clarity. In this example two 
calculator outputs are shown. While both masses are different, HISTADDRJ3ITS of their most 
significant bits (6 bits in this example) are the same, thus both fell into the same bin (bin 1). Both decoders 
activate the output corresponding to bin 1, and the bin 1 counter correspondingly indicates that the 
histogram should increment the value in bin 1 by 2. Using this approach, the frequency of occurrence for 
each calculated peptide mass can be recorded. Once a full gene window has been processed, the bins are 
passed through a shift register, which identifies the mass range that occurs most frequently. The maximum 
frequency is one of the key components of the score and is returned to the user. The entire histogram 
update process occurs in parallel with the operation of the calculator, but an additional NUM_BINS cycles 
are required to identify the maximum frequency. The next phase uses this histogram to calculate the 
significance of the matching masses as shown in Figure 30. 
35 Score Calculation 

Once the masses from the PIS have been stored and the histogram has been initialized the score 
calculation process begins. This process consists of two operations that occur in parallel: mass matching 
and significance computation. The mass matching operation compares every calculated mass to the PIS 
values saved in the on-chip RAM to identify any matches. The significance computation uses these 
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matching masses to determine the significance of the gene window at a hit location. The two remaining 
components of the final score, namely the number of matches and the product term are calculated by these 
operations. The following sections describe the architecture and operation of the hardware that implements 
these operations. 
5 Mass Matching 

Once the histogram has been initialized, the masses from the tryptic peptide calculator are once 
again sent to the scoring unit. In this state however, the masses are not used to update the histogram. 
Instead, the calculated masses are compared with the true PIS masses that were stored earlier to identify 
any matches between the tryptic peptides in the current gene window and those detected by the MS. Figure 

10 34 represents the architecture implemented to perform the mass matching operations. 

The goal of the mass matching hardware is to identify calculated masses that fall within a user 
defined threshold of a value in the true PIS. Given a tryptic peptide mass from the calculator, its closest 
corresponding mass is identified in the true PIS by once again using data associative techniques. To see 
how the closest matches are identified, recall the storage scheme used to save the true PIS. 

15 The on-chip RAM, in which the true PIS masses are stored, is set into a read only mode and 

ADDR_BITS of the most significant bits of the masses from the calculator are used as addresses. Doing so 
retrieves the PIS mass that was stored at the same address, i.e. the retrieved PIS mass falls into the same 
range as the calculated mass. 

The difference between the calculated mass and the stored PIS mass is then calculated. This 

20 difference is passed to a comparator along with a user-defined threshold. If the difference is less than or 
equal to the threshold, the comparator signals a match as illustrated in Figure 34. The match signal is 
passed to the controller, which increments a counter to keep track of the total number of matches found in 
a window. This is one of the key components of the final score for the current gene window. 

The matching masses identified here are used in the significance calculation step where the final 

25 component of the score, namely the product term, is computed. This process is detailed in the following 
section. 

Significance Calculation for Matching Masses 

In addition to the number of matches, the scoring algorithm ranks the matches by significance. 
Figure 30. shows that the significance calculator receives frequency values from the histogram in addition 

30 to the matching mass values. The purpose of the significance calculator then, is to determine the ranges 
into which matching masses fall, and compute the product of the frequencies of these ranges. This 
corresponds to the product term. 

The peptide mass calculator can produce a maximum of NUM_CODONS+l matching masses 
(i.e. every output of the calculator matches a mass value in the true PIS). To account for this event, the 

35 most significant HIST_ADDR_BITS bits of matching masses are used to identify the range the mass falls 
into. The frequency of this range is read from the appropriate bin of the histogram and placed in a pipeline 
as shown in Figure 35. As with the tryptic mass calculator, the pipeline is used to ensure that the product of 
the frequencies of multiple matching masses can be computed per cycle to meet the throughput of the 
calculator. Each of the NUM_CODONS+l stages of the pipeline processes a single frequency value per 
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cycle. In the subsequent cycle, the unprocessed frequencies from every stage are passed to the following 
stage. However, the processing units depicted in Figure 35 do not directly compute the product of the 
frequencies. 

To calculate the product of the frequencies in the pipeline, the technique of logarithmic addition is 
employed as represented by the log and accumulator blocks in Figure 35. This method relies on the fact 



that 



M 1=1 



where f m corresponds to the frequency of a matching range and n is the total number of matches. Thus, 
instead of explicitly calculating the product of the frequencies, the sum of the logarithms of these values is 
taken. The actual product can be determined by taking the inverse of the logarithm of the accumulated 
value. This approach is primarily used to ensure that the product term can span a large range. The 
logarithm units are NUM_FREQ_BITS bits wide allowing for values between 0 to 2 NUM - FRB «- BIT s to be 
represented. These values are calculated in hardware by lookup tables, which take a NUM_FREQ_BITS 
sized frequency value as input and produce log 10 (frequency) as its output Since the frequencies themselves 
15 are integer values from 0 to J*"™, this simple scheme is sufficient to calculate the logarithms. The 
sum of these logarithms is computed by a set of accumulators to obtain the logarithm of the product term. 
This value is returned to the user, where the logarithm is inverted to obtain the final product term. This 
product term, along with the maximum frequency and the total number of matches between the 
hypothetical PIS and the MS detected values, is returned to the user to calculate the final score given by. 

Score = * 



product (arm 

(maximum ^frequency jf"* -«**!** -«f . 



A small product term indicates a match to an infrequent mass range, which corresponds to a high 
score. In practice, the actual score values produced by this formula vary in orders of magnitude i.e. high 
and low scores are typically several orders of magnitude apart. Therefore it is common for these scoring 
schemes to use 10 log(Score) as the final score value. 
25 Six Frame Score Calculations 

The calculators generate six frames of masses simultaneously. Each of these frames can be treated 
as an independent gene as each encodes a different set of tryptic peptides. Thus six corresponding scoring 
units, are instantiated in the hardware, each of which computes the score of an individual frame of the gene 
under consideration. Therefore each hit in the database is returned to the user with 6 sets of scoring 
information. Since only one of these six frames is the true coding region, the frame that generates the 
maximum final score for a given gene window is considered to be the true coding frame. 
Design Summary 

Figure 12 illustrates an overview of the key subunits of the device. 

1. A search engine that accepts a peptide query from the MS and locates all coding regions 
35 of the peptide in the genome. 
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2. A tryptic peptide mass calculator that translates and digests the genes around the located 
coding regions to produce the mass of the tryptic peptides that are contained in the 
proteins encoded by these genes. 

3. A scoring unit that accepts the calculated tryptic peptide masses (the hypothetical PIS) 
5 and compares the calculated masses to the true PIS from the MS. The scoring unit 

assigns a score to each set of tryptic masses based on their significance. Each location 
identified by the search engine is associated with its score and returned to the user to 
determine the true coding region. 
The design meets the speed requirements of current MS at a significantly lower cost than an 
10 equivalent algorithm implemented in software. 
EXAMPLE 3 

Implementation Details & Results 
Overview 

A protein identification system described herein performs a reverse translated peptide query 
15 search through a Genome database. It locates all genes that can potentially code the query peptide and 
translates them into proteins. It then uses a variant of the MOWSE algorithm to compare the masses of 
these translated proteins to the masses in the PIS of a tandem mass spectrometer. This technique identifies 
and ranks potential coding regions for a protein or set of proteins in an MS sample. The coding regions can 
be sent to gene finding programs (24) (25) or homology search tools (19) to obtain the protein sequence. 
20 Input Data 

For this study MS data was used from the organism Saccharomyces cerevisiae, commonly known 
as baker's yeast. The yeast genome is an excellent model for the human genome since both are eukaryotes 
and thus share several similar proteins (21). The yeast genome (17) consists of 12070522 bases, which 
defines the parameter SIZE_OF_GENOME as 3.4 megabytes using the compression described herein. For 
25 comparison, the human genome is 918 megabytes. 
Search Engine 

hi the search engine, the most crucial parameters are MEM_WIDTH and 
NUM_BASES_IN_MEMWORD, as they dictate the throughput of the system at a given operating 
frequency. The memory word read from the TM3A is 64 bits wide, but the compression scheme operates 
on multiples of 7 bits; therefore a MEM_WIDTH of 63 bits was used. The compression scheme uses 7 bits 
to encode a codon (or 3 bases) resulting in a NUM_BASES_IN_MEMWORD of 27 bases. 
Gene Window 

After passing through the search engine, the uncompressed memory word enters the gene window 
before it is sent to the calculator. The size of the organism's gene governs the size of the gene window 
upon which the calculator operates. Studies of the genes in yeast have shown the average gene size to be 
approximately 1450 bases (20). The gene window is thus implemented as 18-word 81-bit shift register 
(corresponding to a GENE_SIZE of 1458 bases). In contrast, the average gene size in human chromosome 
7 is 70,000 bases with 10% of the genes as large as 500,000 bases. This expansion in size is due to more 
alternative splicing (55% of chromosome 7 genes are spliced as opposed to 4% in yeast) (28). 
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Mass Calculator 

The bases from the gene window are read and translated by the calculator into peptide masses 
Measurements on the dataset showed that tryptic peptides range in mass from 0 to 10 KDa a 20-bit mass 
value ((220 = 1048576) allows for masses between 0 and 10,485.76 Da. However for an additional level of 
precision, 5 more bits are used to further divide these masses into 0.0003125 Da ranges Thus 
NUM_MASS_BITS is set to 25 bits. 
Scoring Unit 

The masses from the calculator are passed to the scoring unit, which ranks them in a similar 
manner to the MOWSB algorithm. MOWSB defines bins of 100 Da, which were approximated by setting 
NUM.BINS to 128 bins. In the mass range between 0 and 10,485.76 Da, Ms translates to bins of 
approximately 82 Da. The choice of 128 bins in turn defines HIST_ADDR_BITS as 7 bits, as 7 bits of the 
mass are needed to identify 127 bins. 



15 



20 



Parameter 


Values 


Values 




(Yeast) 


(Human) 


MEMJWIDTH 


63 bits 


63 


SIZE_OF_GENOME 


3.4 Megabytes 


917 Megabytes 


NUM_CODONS 


9 codons 


9 codons 


GENB_J5IZE 


1458 bases 


35000 bases 


ADDRJBITS 


9 bits 


9 bits 


NUM_MASS_BITS 


20 bits 


20 bits 


NUM__BASES_IN_MEMWO 
RD 


27 bases 


27 bases 


HIST_ADDRJBITS 


7 bits 


7 bits 


NUM_BINS 


128 bins 


128 bins 


NUMJFREQ_BITS 


8 bits 


8 bits 



The parameter values in Table 3 are chosen for a design with sufficient resources to perform the 
scormg operations accurately. In the following section the implementation details of a device designed 
with these parameter values is presented. 
Implementation Details 

hi this section, the particulars of the design implemented with the values in Table 3 are presented 
Firstly, the functionality of the design when used with MS data is shown. In the subsequent sections 
hardware and software platforms implementing the design at varying levels of performance are considered. 
Finally the costs of these systems are compared in an attempt to identify a practical solution. 
Functionality 
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The following tests were performed to gauge the performance of the system with real MS data. 
The data used were obtained from the study performed in (33). The study utilized Liquid chromatography 
tandem mass spectrometry (LC-MS/MS) analysis using a Finnigan LCQ Deca ion trap mass spectrometer 
fitted with a Nanospray source. Protein identification was performed by the search engines Mascot (22), 
5 Sonar (35), Sequest (36) and PepSea (37). The input sample used in the experiment contains two well- 
characterized proteins from Saccharomyces cerevisiae (baker's yeast): 

1. A Rab Escort Protein (REP) [ACCESSION: NPJH5015] 

2. A heat shock protein from the SSB2 variant of the HSP70 family [ACCESSION: 
NPJH4190] 

10 Rab Escort Protein (REP) 

The REP in the protein sample is from the MRS6 family of proteins created by the MRS6 gene, 
located in yeast chromosome 15. A full gene map is located on the Saccharomyces Genome Database 
(SGD) (18). Its coordinates in the database (i.e. the bases that the gene spans) are. from 1025599 to 
1026956. (located in Chromosome 15 (18) 
15 Heat Shock Protein (HSP701 

The HSP70 family is coded by the SSB1 and SSB2 genes located on chromosomes 4 and 14 
respectively. The sample contains the SSB2 subfamily variant coded by the gene in chromosome 14. 

Each of these chromosomes codes a different subfamily of the HSP70 proteins but both have 
extremely similar sequences (BLAST (19) of the 2 sequences shows 551 out of 613 matching amino acids 
20 (89% identity)). A full gene map is located on the SGD. (located in Chromosome 4 (16), located in 
Chromosome 14 (17)) 

Its coordinates in the database are: 

• from 1427427 to 1429279. SSB1 variant (located in Chromosome 4) 

• from 966 1724 to 9663575. SSB2 variant (located in Chromosome 14) 

25 Table 4 lists the some of the peptides that were provided as queries to the search engine alongside 

the hit locations reported by the search engine. 
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Protein 



REP 



HSP70 



Query Sequences Hit 
(minimal query) 2 



vpealqr 
(vpealq) 



Location^) 



1025938 



saavggptyk 
(saavg) 



nttvptik 
(nttvpt) 



UsdfFdgk 



tgldisddar 
(tgldis) 



fedlnaalfk 
(fedlna) 



1026060 



1428705 
9663002 



1428495 
9662792 



1428190 
9662487 



1428352 
9662648 



10 



Protein 


Query Sequences Hit 


Locations) 


Score 






(Gene) 






nttvptik 


1428705 (SSB1) 


62 






9663002 (SSB2) 


89* 




Hsdffdgk 


1428495 (SSB1) 


65 


HSP70 




9662792 (SSB2) 


89* 




tgldisddar 




^67 






9662487 (SSB2) 


88* 




fedlnaalfk 


1428352 (SSB1) 


66 






9662648 (SSB2) 


88* 



Table 5: Score identifies subfamily variant in HSP70 

lus or greater m length. An average sequence 
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Protein 




REP 



eyvpr 



ilfak 




1026605 
6672335 
2264445 



1938133 
1323971 
5006575 
6224783 
1025581 
5231459 
9309092 
3108258 



79* 
76 
66 



96 
90 
89 ■ 
84 
72* 
71 
70 
61 
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Any short peptide query will match the true gene location and may produce several false 
positives. Thus if several peptide queries are used, the matches will be clustered together (within the true 
coding gene) while the false positives will be randomly distributed throughout the genome. 

This can easily be seen in the data in Table 6. The two true matches are only 1024 bases apart 
which is within the size of a single gene. The next closest match occurs between the hit at 1026605 and" 
1323971, but these locations am 297366 bases apart. It is thus easy to identify the true hits as they are 
clustered together. * 



"Hfak" hit 
locations 


Closest Match in 
"eyvpr" 


Distance to closest 
match 


1025581 


1026605 


1024 


1323971 


1026605 


297366 


1938133 


2264445 


326312 


3108258 


2264445 


843813 


5006575 


6672335 j 


1665760 


5231459 


6672335 


1440876 ] 


6224783 


6672335 


r 447552 


9309092 


6672335 


2636757 



Table 7: Closest Distances between Match Locations 
Table 7 shows the distance between the closest matches using the two peptide queries from Table 
6. Usmg this information, it was deduced that matches that are close to each other indicate the presence of 
peptides being coded by the same gene, which in turn corresponds to the true hit location. Thus, the inverse 
of the difference between match locations is used to identify the true coding gene. 

In Figure 36 a scaled representation of the distance between the two queries is presented. The 
inverse of the distance between matches - which is defined as "closeness" - is presented across all bases in 
the genome in Figure 36. The closeness value is scaled by a factor of 1 X 10 7 for better visualization 

In Figure 36 the true hit can be clearly distinguished from the other matches. Thus by using two 
peptides hits can be identified that cluster around a single gene and thereby discriminate a coding gene 
from random matches. 

The short query peptides in Table 6 are natural, i.e., the peptides occur naturally via trypsin 
digestion. However, similar cases arise if the quality of the sample is poor and only a few amino acids can 
be sequenced. In these cases, the MS may only be able to resolve a short length of full tryptic peptide 
forcing the MS operator to search the database with a shorter query. 

To replicate the effect of these low quality samples searches were carried out using queries that 
are smaller than the minimal query. In effect, substrings of the queries in Table 4 were used to simulate the 
behaviour of "dirty" samples. 

In the following example the two queries "saavggptyk" and "eyvpr" from Table 4 and Table 6 
respectively are considered. To simulate low-quality sequences, the substrings "saav" and "eyvp" of these 
peptide sequences were used. However, the true hits are ranked 65th of 128 hits and 13th out of 48 hits for 
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the queries "saav" and "eyvp" respectively. It is clear that the MOWSE scoring algorithm cannot 
distinguish the true coding locations from false positives. However, using the technique summarized in 
Table 7, the distance between hits can be examined. The 5 closest matches are presented in Table 8. 



"saav" Hit Locations 


Closest Match in "eyvp" 


Distance to Closest Match 


1026060 


1026605 


545 


7486943 


7488841 


1898 


8964661 


8965326 


2305 ~ 


10170118 


10165117 


5001 


9383697 


9378467 


5230 



30 



Table 8: Distance Between Hits in "eyvp" and "saav" 

As before, the inverse of the distance to the closest match - the closeness - between hits produces 
a map of the genome in which the true coding gene is easily identifiable (Figure 37). The true hit can easily 
be distinguished from 127 false positives, even when the query is only four amino acids long. 

The results show that in many cases, the true coding region can be easily identified by using 
multiple queries. With a query of five amino acids, the true coding location was always correctly identified 
using two peptide queries to the database. When using a query length of four amino acids, the number of 
hits per query increases. With more hits, more queries are required to accurately identify the true coding 
region. Using two queries of length four identified the true hit in eight of 12 searches. Of the four 
erroneous cases, the true hit location is ranked 2nd in three of these and 3rd in the remaining case. In each 
of these cases, the distance between hits can be calculated in a few milliseconds, without significant impact 
on the speed of the search and score process. 
Design Implementation on the TM3A 

The TM3 A described herein, was the primary implementation platform for the design. 
Considering the architecture of the TM3A, the device was partitioned across four FPGAs. The design is 
partitioned as shown in Figure 38 and as follows: 

• FPGA 0: Search Engine and Gene Window 

• FPGA 1 : Mass Calculator and Scoring Units (for Frames 1 and 4) 

• FPGA 2: Mass Calculator and Scoring Units (for Frames 2 and 5) 

• FPGA 3: Mass Calculator and Scoring Units (for Frames 3 and 6) 

FPGAs 1, 2 and 3 have identical units implemented on them The distinction lies in the data that 
they receive from the gene window. FPGA1 receives the data from the gene window directly, and 
produces the scores from Frame 1 and its complement (Frame 4). FPGA2 and FPGA3 receive the data 
from the gene window shifted by 1 base and 2 bases respectively, and correspondingly produce the scores 
of Frames 2 and 3 and their complements. Using this structure, the individual FPGAs can be classified by 
the units they implement. Therefore the design will be described in terms of search engine FPGAs and 
calculator and scoring unit FPGAs. 
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Compiling the design with the parameter values described in the previous section resulted in an 
implementation that did not fit on the TM3A due to insufficient resource, The 25-bit mass and 128-bin 
h.stogram force the calculator and scoring units to occupy more area than is available on a Xilinx Virtex 
2000E FPGA. In combination, these units occupy 44338 LUTs and flip-flops, but Table 9 shows that the 



FPGA 


Number of LUTs and 
FFs 


Block RAM 
(bits) 


User IO pins 


Virtex 2000E 


38,400 


655,360 


804 


Virtex II 8000 
~~ Stratix EPI-S20 


93,184 


3,024,000 


1,108 




18,460 


1,669,248 


586 


Stratix BP1-S40 


41,250 


3,423,744 


822 


Stratix EP1-S80 


79^)45 


7,427,520 


1,238 



10 



15 



In an attempt to fit the device on the TM3A, the design was modified to use 18-bit masses with a 
64-bm histogram thus reducing the area occupied by the calculator and scoring units. This modification 
enabled the units to fit on the TM3A, and the speed and area remits for the individual FPGAs are 
presented below. 



Design 
Platform 


LUTs 


FFs 


Memory 
(bits) 


Operating 
Frequency 
(MHz) 


Search Time 
through Human 
Genome(s) 


TM3A- Virtex 
2000E | 


8,622 


1,858 


8,786 


89 


1.4 



Table 10: Total Resources and Speed for Search Engine on Virtex 2000 E 



Design 
Platform 


LUTs 


FFs 


Memory 
(bits) 


Operating 
Frequency 
(MHz) 


Processing Time 
for Human 
Genome(s) 


TM3A - Virtex 
2000E 


27,925 


12,475 


34,816 


58 


2.1 



2000E 

The searching and scoring times shown are for the human genome, and not yeast The ultimate 
goal of these sequencing experiments is to identify human proteins; the search times presented in Table 1 1 
are more relevant when evaluating the practicality of the tool in useful biological experiments. The 
funcuonahty of the device is not dependent upon the organism under consideration; indeed the only 
parameter affected is the value of SIZE_OF_GENOME, which is set to 918 megabytes (approximately 1 
GB) when using the human genome. 
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From the tables above, it is apparent that the calculator and scoring units limit, and thus define, 
the system speed. Table 11 shows that it takes 2.1 seconds to identify and score all gene locations that 
match a single peptide query. This speed however is not achievable on the TM3A due to the limited speed 
of the SRAM. The operating frequencies in Table 10 and Table 11 apply only to the FPGA under 
consideration and are independent of memory speeds. The SRAM on the TM3A operates at a maximum 
frequency of 50 MHz making it the system bottleneck. Taking the memory speed into account, the 
operating frequency of the system is restricted to 50 MHz and the operating time is calculated for a single 
query to be 2.4 seconds. 

In addition to the memory bottleneck, further problems arise as a result of the reduction in 
accuracy mentioned above. Using the less accurate 18-bit mass representation and coarser 64-bin 
histogram severely lower the performance of the scoring algorithm, thus the area and system speed 
presented above are not representative of a practical design. Note that this limitation only applies to the 
calculator and scoring units. The search engine fits on a Virtex 2000B FPGA and is not affected by the 
reduced parameters. Regardless, it is obvious that the TM3A, while a practical prototyping tool, is not 
adequately equipped to maximally implement this design. 

To obtain realistic figures for area and speed, the design was recompiled with the parameters in 
Table 3 to target a set of modern FPGAs with more resources. These results are presented in the following 
section. 

Design Implementation on Modern FPGAs 

A new design implementing modern FPGAs and high-speed commercial memory is described 
below. The FPGAs under consideration are listed in Table 9. The newer FPGAs, namely the Xilinx Virtex 
II 8000 FPGA (31) and the Altera Stratix S40 and S80 FPGAs (32), all have more resources than the 
Virtex 2000E FPGAs on the TM3A. The Stratix S20 is included in Table 10 as it is the smallest FPGA 
upon which a search engine will fit The speed and resource utilization tables are partitioned into 
individual FPGAs. The implementation of the search engine on each of the FPGAs is shown in Table 12. 
Correspondingly the implementation of the calculator and scoring units upon the Virtex II 8000 and Stratix 
S40 and S80 FPGAs is shown in Table 13. Due to the lack of resources on the Stratix S20, the calculator 
and scoring units do not fit on it 
Search Engine 



FPGA 


LUTs 


Flip Flops 


Memory Bits 


Operating 
Frequency 
(MHz) 


Search Time 


Stratix S20 


10,605 


1,694 


7,938 


163 


0.7 


| Stratix S40 


10,605 


1,694 


7,938 


152 


0.8 


Stratix S80 


10,605 


1,694 


7,938 


148 


0.8 



30 



Table 12: Total Resources and Speed for Search Engine using Current Technology 

The reduced operating frequency on the larger devices in Table 12 can be attributed to the fact 
that the smaller devices have shorter wires, which have less capacitance, and are thus faster. 
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bandwidth does not scale with processor speed, but this assumption presents the ideal performance of this 
algorithm in software. A single modern processor currently cannot achieve the 1-second search and score 
time. 

As with the hardware, the algorithm is highly parallelizable and indeed MASCOT is a threaded 
5 program, designed to be implemented in a multiprocessor environment (22). To meet the 1-second 
operation time, the processing time scales were assumed perfectly with cluster size, i.e. to halve the time, 
the cluster size must be doubled. Table 14 shows the number of processors required to achieve 
performance that is comparable to the hardware. 



Number of Processors 


Processing time(s) 


1 


52.5 


32 


1.6 


64 


0.8 



Table 14: Processing Time for Computing Cluster 



10 Table 14 shows that a cluster of 64 processors can achieve the performance delivered by two 

copies of the hardware as described in the previous section. Thus both systems are capable of offering the 
same level of performance. 

In the next section, the system is parameterized based on the resources required to achieve a user- 
defined level of performance. The required resources allow an estimation and comparison of the costs of 

15 the hardware and software systems to evaluate the most cost-effective solution. 
System Cost and Resource Estimation 
Cost of Hardware Platform for Full System 

The most cost effective implementation for a system of the invention is achieved on a set of 4 
FPGAs: one S20 for the search engine and three S40 FPGAs for the 6 frames of calculation and 

20 corresponding scoring units. Such a system requires sufficient RAM and a suitable PCB to act as a 
motherboard. The following is a selected design: 

• . Each set of 4 FPGAs requires a 10.5" x 14" - 14 layer PCB as its motherboard. 

• Every search engine in the system has 2 GB of memory. 

Multiple hardware units can be used to search subsections of the genome in parallel. Clearly a 
25 subsection of the genome will not require the storage space of the full genome. However, small memory 
modules are difficult to acquire commercially, and large memory modules can be purchased relatively 
inexpensively (29). Thus each hardware unit contains a full 2 GB of memory even though this is 
unnecessary for the design. A hardware system that takes under 1 second to search and score using a single 
peptide query, can be implemented for less than half of the acquisition cost of an equivalent software 
30 system. 

The Stratix Power Calculator (34) is a tool that allows a designer to estimate the total power 
consumed by a design on a Stratix FPGA. Using the resource values from Table 12 and Table 13 the 
power consumed by the full hardware system is estimated as 7.6 W (1 W for a Stratix S20 containing 
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search engine and 2.2 W for each of the three Stratix S40 containing the calculator and scoring units). The 
majority of the power is dissipated in the IO pins. All the FPGAs are running at 75 MHz and a 25% toggle 
rate is assumed for every flip flop and memory bit in the design. The custom hardware implementation 
consumes 200 times less power than general-purpose processor cluster. This reduction in total power 
consumption translates into a significantly lower operational cost over the lifetime of the cluster. 
Cost of Hardware Platform for Standalone Search Engine 

The search engine operating as an isolated unit does not require the same number of FPGAs or a 
PCB of the same complexity as the full system. Therefore the following design decisions are made for the 
standalone search engine: 

• A 10"x 4" -8 layer PCB is required as the motherboard and can contain two FPGAs 

• Every search engine in the system has 2 GB of memory 

Using these constraints, the Stratix S20 was found to be the most cost effective FPGA upon which 
to implement the search engine. The hardware searching system costs approximately 40 times less than a 
software platform of comparable performance. 

The power savings are even more significant in the case of a clock speed of 162 MHz and a 25% 
toggle rate for every flip flop and memory bit, with the hardware providing over 2000 times the power to 
performance ratio of a software cluster. These results indicate that there are significant advantages to 
performing genomic searches in hardware. 
Cost Comparison 

This section summarizes the costs of the system, by dividing the solution into two broad 
categories, namely, low-performance and high performance. Here, low performance indicates search times 
in excess of a minute, which may be acceptable in many applications. However, the design must be able to 
identify and rank the coding locations for a peptide query in less than 1 second, thus demanding a high 
performance system. 

For slower searches of the genome, i.e. search times in excess of 1 minute, software is a more cost 
effective solution than hardware. The software cost is based on the quoted price on a 2.4 GHz Dell 
Dimension Desktop (30). The cost of its hardware counterpart is based on the cost of a single hardware 
board capable of implementing the full system. It is possible to design a hardware system using cheaper, 
slower FPGAs but if real time performance is not required, a PC is likely a far more flexible solution with 
a greater capacity for reuse in other applications. Moreover, a PC at half the price of the hardware system 
is clearly a better choice. Therefore, at the low end of the performance spectrum, software is more practical 
vehicle for the searching and scoring process. 

However, using the current cost and performance of the system as a measure of quality, hardware 
is clearly a better solution for an entity seeking the ability to search through genomes in real-time. At the 
high-performance end of the cost spectrum, hardware is more than three times as economical for 
equivalent level of performance. For a standalone search engine, hardware is more than 40 times as 
economical, making it an ideal platform for genomic studies. 

The costs do not take power consumption into account. However, the performance to power ratio 
is far more favourable for hardware, than a cluster of general-puipose processors. Over the operational 
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lifetime of the hardware platform, the power savings will likely translate to a substantial reduction in 
operational cost when compared with software. 

The key resources that determine this cost of a hardware system are: the FPGAs, the RAM and 
the PCB. The FPGA (26), RAM (29) and PCB (27) costs are obtained from current vendor and 
manufacturer quotes. System designers in the future will likely have access to FPGAs with far more 
resources for which prices cannot be accurately predicted. As such the resources required for a given level 
of performance are defined. Knowledge of the required resources will allow selection of the most practical 
platform upon which to build the hardware. 

In general, to design a system that meets a specific level of performance, the required resources 
can be estimated by the three elements listed above: FPGAs, RAM and PCBs. The total cost of the 
hardware is then given by the number of FPGAs (defined as NUM_FPGAs), the total amount of RAM 
(TOTAL_EXT_RAM) and the number of PCBs (NUM_PCBs). This cost is a function of the desired level 
of performance specified by the designer. The performance is specified by the time required to process an 
entire genome, thus the two variables that determine the hardware resources for the system are 
15 size_of_genome (in GB) and searchtime (in seconds). Thus the performance factor: 
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P - 



size, of genome 
search time 



The designer can use the desired value of P to determine the cost of the system in the future. This 
cost is given by: 

COST (P) = (NUM_FPGAs(P) x FPGA.PRICE) + (TOTAL_EXT_RAM(P) x RAM PRICE) + 
(NUM_PCBs(P) x PCB_PRICE) 

An FPGA is classified in terms of its key components, namely the LUTs, flip-flops and memory 
and user IO pins. Given these parameters, it is possible to determine the most cost-effective FPGA or set of 
FPGAs. The total number of LUTs and flip-flops in a given FPGA is defined as FPGA_LUTs_FFs, and the 
total on-chip RAM as FPGA_RAM, and the number of user IO pins as FPGA_IOJPINS. Using these 
parameters, a designer can determine the optimal FPGA for the device. 

The following results are divided into two units: one to provide resource estimates for the full 
search and score system and the other for the search engine as an independent unit 
Resources Required f or Full Search and Score System; 

The values for each of these parameters depend on the performance factor P described above. A 
full implementation of the device from Table 11 requires 12,299 LUTs and flip-flops for the search engine 
and 3 LUTs and flip-flops for the calculator and scoring functions. Thus, with FPGA_LUTs_FFs = 
145313, a I GB genome can be processed in 1.6 seconds. To generalize this it can be stated that: 

FPGA_LUTs_FFs =232500 X P 

Correspondingly, the device in Table 12 requires 7938 on-chip memory bits for the search engine 
and bits for the 3 calculators and the associated scoring functions. Thus 623670 on-chip memory bits are 
required to process the 1 GB genome in 1.6 seconds. This can be stated as: 
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FPGA_RAM - 997872 X P 
The design requires a total of 1014 pins to process the genome as described This enables the 
following definition: 

FPGA_IO_PINS = 1623 X P 
Using these three parameters, the value of NUM_FPGAs can be determined based on the most 
cost effective devices available at the time. To determine the optimal number of FPGAs, the cost and 
resources of a few large FPGAs can be compared with those on many smaller FPGAs. The most 
favourable solution implements the required resources at the minimum cost, thus defining the ideal value 
for NUM_FPGAs. 

The next significant parameter is the amount of external RAM required. A single copy of a 1 GB 
genome can be searched in 1.6 seconds. As the level of parallelism increases and additional copies of the 
device are used to increase the system speed, multiple copies of the genome must be processed. This is 



generalized as: 



TOTAL EXT RAM— — 1— xp 
. 0.625 



Using a design described herein as a reference, it is estimated that four FPGAs and the RAM can 
be connected on a single PCB without prohibitive complexity. This leads to the formula: 

4 

The value of NUM_PCBs clearly hinges on an assumption of 4 FPGAs per board as defined in 
the design. The trend towards larger FPGAs implies that the design will eventually be able to fit on a 
20 single FPGA. 

Each of these formulas is based on the design of the full search and score algorithm that operates 
on a 1 GB genome in 1.6 seconds. The formulas are intended to provide a sense of the required resources 
as the speed, and correspondingly the level of parallelism, within the system increase. If the required 
search time is less than 1.6 seconds, or the size of the genome is significantly less than 1 GB, the 
approximations provided here will be of limited value, as the formulas encapsulate the trend in resource 
requirements for increasing levels of parallelism. 
Resources Req uired for Standalone Search Engine; 

For the standalone search engine, the resource requirements can be defined as a function of 
searchtime and size_of_genome to allow the user to estimate system costs in the'future. The formulas 
given below are based on the data in Table 12 and Table 13 and assume a standalone search engine can 
search a 1 GB genome in 0.8 seconds. 

FPGA_JLUTs_FFs = 9839 ^ P 
FPGA__RAM = 6350xP 
FPGA JOJPINS = 3 13 * P 

35 Once again, the actual value for NUMJFPGAs hinges on the technology available to the designer 

and can be determined based on the cost of available devices. 
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TOTAL EXT RAM - — XV 
~ ~ J .25 

When the design is constrained to two FPGAs per PCB the following formula results: 
NCJM FCB S - 

2 

The caveats from the first set of formulas apply equally well to the approximations above. The 
formulas convey the trends in resource usage based on the search of a I GB genome in 0.8 seconds. 

The formulas above model the resources required for various levels of parallelization, which in 
turn correspond to different levels of performance. As stated the performance is dictated by the time taken 
to process a genome of a given size. Using the resources estimation models above the resources required to 
implement either the full search and score system described or the search engine as an independent unit 
can be estimated. These resources can then be used to determine the cost of the optimal solution based on 
the prices of available devices. 
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The present invention is not to be limited in scope by the specific embodiments described herein, 
since such embodiments are intended as but single illustrations of one aspect of the invention and any 
functionally equivalent embodiments are within the scope of this invention. Indeed, various modifications 
of the invention in addition to those shown and described herein will become apparent to those skilled in 
the art from the foregoing description and accompanying drawings. Such modifications are intended to fall 
within the scope of the appended claims. 

All publications, patents and patent applications referred to herein are incorporated by reference 
in their entirety to the same extent as if each individual publication, patent or patent application was 
specifically and individually indicated to be incorporated by reference in its entirety. All publications, 
patents and patent applications mentioned herein are incorporated herein by reference for the purpose of 
describing and disclosing the cell lines, vectors, methodologies etc. which are reported therein which might 
be used in connection with the invention. Nothing herein is to be construed as an admission that the 
invention is not entitled to antedate such disclosure by virtue of prior invention. 

It must be noted that as used herein and in the appended claims, the singular forms "a", "an", and 
"the" include plural reference unless the context clearly dictates otherwise. Thus, for example, reference to 
"a host cell" includes a plurality of such host cells, reference to the "antibody" is a reference to one or more 
30 antibodies and equivalents thereof known to those skilled in the art, and so forth. 
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Table 1 VHDL Source Code 

1. Search Engine Controller (control.vhd) 

library ieee; 

use ieee.stdJogic_l 164.all; 
use ieee.std_logic_arith.all; 
use ieee.stdJogic__unsigned.all; 



entity control is 
port( 



tm3_clk_v0 

tm3_sram_adsp 

trn3_sram_data 

tm3_sram_addr 

tm3__sram_we 

tm3__sram_oe 

mainjreset 

mem_scanned 

match_address 

codonin 

tm3want . 

sunready 

reset 

mera_for_frame 
freq_enable 
calc_enable 
score sent 



); 

end control; 



: in stdjogic; 
: out stdjogic; 

: inout std_logic_vector(63 downto 0); 
: out stdJogic_vector(18 downto 0); 
: out std_logic_vector(7 downto 0); 
: out stdJogic_vector(l downto 0); 

: in stdjogic; 
: out stdjogic; 

: out stdJogic_vector(18 downto 0); 
: in stdJogic_vector(269 downto 0); 
: out stdjogic; 
: in stdjogic; 
: out stdjogic; 

: out stdJogic_vector(63 downto 0); 
: out stdjogic; 
: out stdjogic; 
: in stdjogic 



architecture ctrljiehv of control is 



component genebuffer 
port( 

clock: IN stdjogic; 

data: IN stdJogic_VECTOR(62 downto 0); 
q: OUT stdJogic_VECTOR(62 downto 0); 
load: IN stdjogic); 
end component; 



component fullprot 
port( 

^ Clk : in stdjogic; 

codonlnp : in stdjogic_vector(0 to 269); 

memwindow : in stdjogic vector(0 to 149); 

foundHit : out stdjogic 

)» 
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end component; 



(rsUoadUoad2,sai 



score,memstatel .done); 



signal memoryjword 
signal dataword 



: stdjogic_vector(0 to 188); 
: stdJogic_vector(63 downto 0); 



signal queryl 
signal query2 



: stdjogic_vector(0 to 269); 
: stdjogic_vector(0 to 269); 



signal stored_data 
signal freq_window_out_buffer 
signal mass_window_out_buffer 
signal memjo^frames 
signal freq_memJo Jrames 
signal mass_mem_to_frames 
signal load_gene_window 
signal load_mass_window 



signal calc_operation 
signal freq_operation 



: stdJogic_vector(62 downto 0): 
: stdJogic_vector(62 downto 0)' 
: std_logic__vector(62 downto 0) 
: stdjogic_vector(0 to 125); 
: stdjogic_vector(0 to 125); 
: stdjogic_vector(0 to 125); 
: stdjogic; 
: stdjogic; 

: stdJogic_vector(8 downto 0); 
: stdJogic_vector(8 downto 0); 



signal testnet 

signal currAddr 
signal codon_ctr 
signal currState 
signal nextState 
signal mainhit 
signal cmplhit 



: stdjogic; 



: stdJogic_vector(18 downto 0); 
: stdjogic_vector(0 to 0); 
: ctrlStates; 
: ctrlStates; 

: stdjogic; 
: stdjogic; 



signal freqL_enableJine : stdjogic; 
signal calc_enablejine : stdjogic; 

attribute synJ>lackJ>ox : boolean; 

attribute synjjlackjwx of genebuffer : component is true; 



reset <= mainjreset; 

freq_genewindow : genebuffer port map ( 

clock => tm3_clk_v0, 
data «> memory_word(0 to 62), 
q => freq_window_out_buffer, 
load => loadLgene^window); 



mass^genewindow : genebuffer port map ( 
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clock => tm3_clk_v0, 
data => freq_window_out_buffer, 
q => mass_window__out_buffer, 
load => Ioad_gene__window); 



proteinblock : fullprot port map ( • 



fpClk^tm3_clk__vO, 

codonlnp =» query 1, 

memwindow => memory word(0 to 149) 

foundhit => mainhit ~ 1 

); 



complmntblock : fullprot port map ( 



rpClk=>trii3_cIk_vO, 

codonlnp => query2, 

memwindow => memory word(0 to 149) 

foundhit => cmplhit ~ 

); 



operation ) 



calc_enablejtfne <= '<)'; 
rreq_enable_Jine <= '0'; 
load_gene_window r 0 r ; 
tm3want <= '0'; 
tm3_sram_we <=* "1111 Zlll"; 
tm3_sram_oe <= "0 1 "; 
tm3__sram_adsp <^ T; 
tm3_sram_addr <= currAddr; 
tm3_sram_data <= (others => 'Z'); 
mem_s canned <= '0 1 ; 
nextState <= rst; 



case(currState) is 

when rst = 



nextState <= loadl; 

when loadl => 

tm3want<=T; 

tm3_ sram__data <= dataword; 

if sunready«Tthen 

nextState <= load2; 



end if; 



nextState <= loadl; 



when load2 =o 

tm3want o= '0'; 



if sunready = '0 1 then 

nextState <« save; 



else 



nextState <« load2; 
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endif; 



when save => 

tm3_sram_addr <= currAddr; 
tm3_sram_adsp <= '0'; 
tm3_sram_pe <o= "Ol"; 

if codon_ctr = M l" then 

nextState <= meminitl; 

else 

nextState <^loadl; 

end if; 



when meminitl ~> 

tm3_sram_addr <=> currAddr; 
tm3__sram_adsp <= '0'; 
tm3_sram_oe <= "01"; 

nextState <= merainit2; 



when meminit2 => 

tm3_sram_addr <*= currAddr; 
tm3_sram_adsp <= , 0'; 

tm3_sram_oe <= "01 "; 
nextState <= meminit3; 



when meminit3 =o 



tm3_sram_addr <= currAddr; 
tm3_srara_adsp <= *0'; 

tm3_srani_oe <= "01"; 

ifscore^sent^Tthen 

load _gene_window <= T; 
nextState <*> memstatel ; 

else 

load_gene_window <= '0'; 
nextState <= meminit3; 

end if; 
when memstatel «> 

if calc_operation > "000000000" then 
calc_enable_line «o= 1 1 *; 

else 

calc_enablejine *0 f ; 

end if; 



WO 2004/109579 



51- 



PCT/CA2004/000853 



if freq_operation > "000000000" then 
freq_enable_line <=T; 

else 

freq_enable_jine <=» '0'; 

end if; 

load _gene_window <= T; 



tm3_sram_addr currAddr; 
tm3_sram_adsp <= '0*; 
tm3_sram__oe <= "01 "; 

••1000000000000000000") then * = T) " (Cmpm = T) ° r < currAddr >= 

nextState <= madematch; 

elsif (score_sent = ' 1 »)then 

nextState <*= memstatel ; 

elsif (score_sent « 'O^then 

nextState <- returnScore; 

end if; 



when madematch => 

if currAddr >= "lOOOOOOOOOQODOOOOOO" then 
mem_scanned <=* '1'; 
nextState <= done; 

else 

nextState memstatel; 

end if; 



when returnScore => 

if scorejsent « T then 

nextState <= madematch; 

else 

nextState <= returnScore; 

end if; 



when done => 

nextState <= done; 



when others => 



end case; 



end process; 

process(tm3_clk_v0,main_reset,codon_ctr,mainhit,cmplhit,calc operation) 
begin — 
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ifmain_reset = Tthen 

currState rst; 



elsif rising^edge(tin3_clkv0) then 

tf -rS 0000 " 811(1 fre <L^on<«000001111«»hen 

memjbr_rrarne <= freq_memjo_frames(0 to 63); 
^J^Z^r 00 " "* ^-<*™«™ < B «»0«>nn- then 
end if mem - for - frame <= mass_mem_to_frames(0 to 63); 



freq^enable <= freq_enable_line; 
calc_enable <» calc_enablejine; 

currState o= nextState; 

case (currState) is 

when rst =o 

codon_ctr<= (others => '0'); 
currAddr <= (others => , 0 t ); 
dataword <= (others => , 0') > ; 
calc_operation <= (others => '0'); 
freq_operation <= (others => '0'); 

when Ioadl => 

dataword (others => 'l 1 ); 

when load2 «=> 



when save => 

codon_ctr codon_ctr+l; 

if codon_ctr = "0" then 

query K=codonin; 

else 

query2<=-codonin; 

end if; 
when meminitl => 



memory^word(0 to 62) tro3_sram_data(63 downto 1); 
currAddr <*= "0000000000000000000"; 



when meminia => 



memory_word(63 to 125 ) <- tm3_sram data(63 downto IV 
currAddr <- "0000000000000000001"; ~ h 
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when raeminit3 => 



calc_pperation <« (others => '0 1 ); 

memory^wordCne to 188 ) <= un3_sram data(63 downto IV 
currAddr "00000000000000000 10"- ~ uow ™o U. 



when memstatel => 
if (mainhit » •!•) or (cmplhit - T) then 



x - / vvuipum — i ) men 

freq_operation <= "000000001 
elsif freq_operation>;000000000" and freq_operation < "0000011110" then 
frecLoperation <*= freq_operation + 1; * i iu then 

elsif fi*eq_operation = "000001 1 1 10" then 
freq operation Mothers => wv 



* — a i iv men 

freqLoperation (others => '0'); 
calc_operation <- "000000001" 

end if; 



if calc.operation > "000000000" and cooperation < "000001 1 1 10" then 

calc_pperation <= calc_operation + 1* 
elsif calc ODeration — ■ "ftnnnni mn« *i 



K.tuuuperanon ^= caic_operatiorj 
elsif calc_operation - "000001 1 1 10" then 

calc_operation <= (others ==> '0'V 
end if; " 



match_address <= currAddr; 

--mero_to_frames(0 to 62) <= mem_toJxame S (63 to 125V 
--mem_to_rrames(63 to 125) <- window out buffer 
freq_memjo^trames(0 to 62) <- rreq^mem To franlesf63 to 125V 
frecumemJo_frames(63 to 125) <= feq.wSdo^^buffe; ^ 

mass„memjo^fraines(0 to 62) <= mass_raem to rrames(63 to 125V 
mass„memj 0 „franies(63 to 125) mass^wnTdoVoutXr^r; 

for i in 0 to 125 loop 

end loop- memory - word W <== memory_word(i+63); 

memory^word(126 to 188 ) trn3_sram data(63 downto 1 )■ 
currAddr <= currAddr +1; ~ v ; ' 



when done => 
when others=> 



end case; 



end if; 

^ t end process; 

end ctrl_behv; 



2. Peptide Comparison Unit (protein.vhd) 

library IEEE; 

use IEEE.STDLOGICM 164.ALL; 
use IEEE.STD_LOGIC_ARTTH.ALL; 
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use DEEE.STD_LOGIC_UNSIGNED.ALL; 



entity protein is 
generic (numAA:integer:=10); 
port ( 

pClk 

potentialCodonsl 

potentialCodons2 

potentialCodons3 

memWord 

onehit 

end protein; 



: in stdjogic; 
: in std_logic_vector(0 to (9*numAA )-l); 
: in stdjogic_vector(0 to (9*numAA )-l 1 
: in stdjogic_vector(0 to (9*numAA )-l 
: in stdJogic_vector( 0 to (9*numAA)-l 
: out stdjogic 

); 



architecture prot_behv of protein is 

signal rowHit : stdJogic_vector(numAA-l downto OV 
signal phitline : stdjogic; 
signal hi: stdjogic; 



component amino 
port( 



); 

end component; 

component big_and 
Port( 



aClk 

codoninl 

codonin2 

codonin3 

memPort 

hit 



: in stdjogic; 
: in stdjogic_vector(0 to 8); 
: in stdjogic_yector(0 to 8); 
: in stdjogic_vector(0 to 8); 
: in stdjogic_vector(0 to 8); 

: out stdjogic 



end component; 



elk : in stdjogic; 
And_in : in stdJogic_vector(l 1 downto 0); 
And_out : out stdjogic); 



begin 

hi<= , l l ; 

rowOEAminos : for i in 0 to numAA-1 generate 
oneAA : amino port map ( 
aCJk=>pClk, 

codoninl => potentialCodonsl ( 9*i to (9*i+8) ), 
codonin2 «> potentiaICodons2( 9*i to (9*i+8) ) t 
codonin3 potentialCodons3( 9*i to (9*i+8) )' 
hit ^rowHit®, 
memPort => memWord(9*i to (9*i+8) ) 



end generate rowOfAminos; 



andaminos : big_and port map ( 
clk=>pClk, 
And Jn(0) => rowHit(0), 
Andjn(l) => rowHit(l), 
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And_in(2) => rowHit(2), 
AndLin(3) => rowHit(3), 
And_in(4) => rowHit(4)» 
Andjn(5) => rowHit(5), 
And_in(6) «> rowHit(6), 
And_in(7) => rowHit(7), 
And_in(8) => rowHit(8), 
And_in(9) rowHit(9) t 
AndJn(lO) => hi, 
And_in(ll)=>hi, 
And_out => phitline 

process(pClk) 
begin 

ifrisingL_edge^CIk)then 

onehit<« phitline; 

end if; 
end process; 



); 



end protjbehv; 



3. Codon Unit (amino.vhd) 

library ieee; 

use ieee.stdjogicj 164.all; 
usework.aU; 



entity amino is 

POrt( cik 

codoninl . m stdJog^clSio 8); 

coS ^stdJogicIvectoKOtog 

-sMJogicvectorCOtos!; 
ui t • m stdJogic_vecto 

y : out stdjogic 

end amino; 

architecture amino__behv of amino is 



signal merahit : stdjogic; 
signal directhit : stdjogic; 



begin 

process( aClk,codoninl, memPort ) 
begin 

if rising_edge(aClk) then 
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(( (codonin2(2) ■ 
( (codonin2(5) = 
( (codonin2(8) = 

(( (codonin3(2) = 
( (codonin3(5) = 
( (codonin3(8) = 



'TormemPortp)* 
Tor memPort(5) = 
Tor memPort(8) « 

TormemPort(2)«= 
Tor memPort(5) = 
Tor memPort(8) = 



hit<=*»l'; 
else 

hit <t» »0»; 
end if; 

end if; 

end process; 



- T ) or ( codonin2(0 to 1) = memPort(0 to 1) ) ) and 
T ) or ( codonin2(3 to 4) - memPort(3 to 



T)or(codonin2(6to7)* 




iemPort(3 to 4) ) ) and 
= memPort(6to7))))or 

- meraPort(0 to 1) ) ) and 
= memPort(3 to 4) ) ) and 
memPort(6to7))))then 



end amino_behv; 



». Tryptic Peptide Mass Calculator Controller (mod_calc.vhd) 



4 

library ieee; 
use ieee.stdJogic_1164.aIl; 
use ieee.std_logic_arith.all; 
use ieee.stdJogic_unsigned.all; 



entity mod_calc is 

generic( num_stages : integer := 10; 

raass_bits : integer := 25 ); 



port( 



); 



elk 

calc_reset 

enable 

ramword 



mass save 
complement, 
complement, 
rdy 



.mass save 



end mod_calc; 
architecture calc_flow of mod_calc is 



: in stdjogic; 
: in stdjogic; 
: in stdjogic; 

: in stdJogic_vector(63 downto 0)- 
: out stdjogic_vector(0 to (num_stages)*(mass bitsMV 
:outstdjogic_vector(lto8); ^ 
:outstd_logic_vector(Oto(num stages)*(mass bitsVIV 
:outstdJogic_vector(lto8); ^ ^ass_oits>i), 

: out stdjogic 
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■ Fragment detection units and mass LUTs 



component masslut 
PORT 
( 



clock 
enable 

q 

); 

end component; 

component cleavecheck 
PORT 
( 

address 

clock 

enable 

q 

); 

end component; 



: IN STDJLOGICVECTOR (5 DOWNTO 0); 
: IN STDLOGIC ; 
: IN STDJLOGIC 

: OUT STD_LOGIC_VECTOR (mass_bits-l DOWNTO 0) 



: IN STD_LOGIC_VECTOR (5 DOWNTO 0); 

: IN STDJLOGIC; 

: IN STDJLOGIC :=T; 

: OUT STDJLOGIC VECTOR (1 DOWNTO 0) 



COMPONENT ambigna IS 
PORT 
( 

address : IN STDJLOGICVECTOR (3 DOWNTO 0V 

clock : IN STD_LOGIC ; 

clken : IN STDJLOGIC ; 

^ q : OUT STDJLOGICVECTOR (0 DOWNTO 0) 

END COMPONENT; 



- Basically the same components; modified to produce values for the complementary strands 



component comply masslut 
PORT 
( 

address 

clock 

enable 

q 

); 

end component; 

component compl_cleavecheck 
PORT 
( 

address 

clock 

enable 

q 

); 

end component; 



IN STD_LOGlC_VECTOR (5 DOWNTO 0); 
IN STDJLOGIC; 
INSTD_LOGIC :=T; 

OUT STDJLOGIC_VECTOR (mass bits-1 DOWNTO 0) 



: IN STD_LOGICVECTOR (5 DOWNTO 0): 
: IN STDJLOGIC ; 
: IN STD_LOGIC 

: OUT STDJLOGIC_VECTOR (1 DOWNTO 0) 
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COMPONENT compl_ambigna IS 
PORT 
( 



address : IN STDJLOGICVECTOR (3 DOWNTO 0V 

clock :INSTDJLOGIC; UU) ' 

clken : in STD_LOGIC ; 

); q : OUT STD_LOGIC_VECTOR (0 DOWNTO 0) 

END COMPONENT; 



signal third_pos_check: stdJogic_vector(l to num stages-1)- 

signal ambig : stdJogic__vector(l to num_stages-lV 

signal word_stage : stdjogic vector(0 to 252)- 

signal discard_buff2 : stdJog"ic_vector(l to num stages)- 

signd mlut_out : stdJogic^vector(((num^stages-l)*mass bits) -1 downto 0V 

•jgod mass_a : stdjo g ic.vector(0 to (num jtages-l^ss bte>™ 

signd massjb : stdjogic_vector(0 to (num^stages.l)*(mass"bits)- IV 

signal discard : stdJogic_vector(l to num_stagesV 

signal discard_buff : stdJogic_vector(l to num stages)- 

signal wordaccum ; stdjogic_vector(0 to (2 *miss bitsMV 

signal accumsave : stdjogic_vector(0 to (2*mass bits)-l)- 

signal mit_ctr : stdJogic_vector(3 downto 0); ~ 

signal massjjreak : stdjogic_vector(l to nwn_stages-l)- 

signal sliamgwindow:stdJogic_vector(0 to (num stages- l)*(mass bits)-l)- 

signal break_in__stage ; stdJogic_vector(l to num "stages)- ^ h 

signal bsJjufT : stdjogic_vector(0 to 2); ~ 

signal followin^break : stdJogicvector(l to num stages)- 

signal save_b:stdJogic_vector(l to num_stages-l) 7 

signal slide_save :stdjogic_vector(0 to num_stages-l)- 

signal fb_buff : stdJogic_vector(l to num stages)- 

signal wildcard : stdJogicjsrector(l to nurS_stages-IV 

signal sd_buff : stdJogic_vector(l to num_stages)- 

signal sdJmfiE : stdJogic_vector(l to num stages)- 

signal startdetected : stdjogic_vector(l to num_stages); 

- Now all the same signals but for the complementary strand 

signal compljhird_j50s_check: stdJogic_vector(l to num stages-1)- 

signal compi_ambig : stdJogic_vector(l to num stages-lV 

signd compLdiscard^buffi : std>gic_vector(llo num stages)- 

signal compl_mlut_out : sMJogic_yector((num stages- UWs bits -1 downtn nv 

S : stdjogic.vec to r<0 to fe^yfif b L T ^ 

sigmd compLmassJb : stdjogic_vector(0 to (nunf staies-1) *mass "bite -IV 

signal compl_discard : stdJogic_vector(l to num stages)- h 

signal compl_discardJ)urT: stdjogic_vector(l tomim stages)- 

signal compLwordaccum : stdjogic_vector(0 to (2*mass bitsMV 

signal compl_accumsave : stdjogic_vector(0 to (2*mass bitsMV 

signal compLinit^ctr : stdJogic_vector(3 downto 0)- " 

signal compLmass^break : stdJogic_vector(l to num stages-1)- 

signd compLshdingwindow : stdjogicvector(0 to (num stages- 1)*32 -IV 

signa complJ>reakJn_stage : stdjogic vector(l to num ItagesV 

signal compl J)s_buff : stdjogic vector(b to 2)- nUm - stages >' 

signal compLfollowingL.break : stdJogicvector(l to num stages)- 

signal compl_save„b:stdJogic,vector(l to num stages-lf 

signd compLslide_save :stdjogicvector(0 to num stages-1)- 

signd complJb>iff: stdJogicvector(l to num stages)- 

signal compLwildcard : stdjogic_vector(l to num" stages-1)- 
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signal compl_sd_buff : stdJogic_vector(l to num_stages); 



signal ml : std_logic_vector(0 to mass_bits-l); 
signal m2 : std_logic_vector(0 to massjbits-l); 
signal m3 : std_logic_vector(0 to massjrits-l); 
signal m4 : std_logic_vector(0 to mass_bits-l); 
signal m5 : stdjogic_vector(0 to massj>its-l); 
signal m6 : std_logic__vector(0 to massjbits-l); 
signal m7 : stdLlogic_yector(0 to massjbits-l); 
signal m8 : std_logic_vector(0 to raass_bits-l); 

signal cml : stdjogic_vector(0 to massjrits-l); 
signal cm2 : std_logic_vector(0 to mass_bits-l); 
signal cm3 : std_jogic_vector(0 to massjrits-l); 
signal cm4 : std_logic_vector(0 to mass_bits-l); 
signal cm5 : std_logic_vector(0 to massj>its-l); 
signal cm6 : std_logic_vector(0 to mass_bits-l); 
signal cm7 : std_logic_vector(0 to massjnts-l); 
signal cm8 : std_logic_vector(0 to mass_bits-l); 



type massStates is (reset,summing); 

attribute ENUMJENCODING : STRING; 

attribute ENUM_ENCODING of massStates : type is "0 1"; 

signal currState : massStates; 

signal nextState : massStates; 



— attribute syn_black_box : boolean; 

attribute syn_black_box of masslut : component is true; 
attribute syn_black_box of cleavecheck: component is true; 
attribute synjblackjbox of ambigna : component is true; 

— attribute syn_black_box of compl_masslut : component is true; 

- attribute syn_blackJ)ox of compl_cleavecheck : component is true; 

- attribute syn_black_box of compl_ambigna : component is true; 

begin 



ml <= mass_b(0 to mass_bits-l); 
m2 <= mass_b(mass_bits to (raassjbits)+mass_bits-l); 
m3 <=■ massj>(2*mass_bits to (2*massJ>its)+mass_bits-l); 
m4 <= mass_b(3*mass_bits to (3*mass_bits)+mass__bits-l); 
m5 <= mas s_b (4*mass_bits to (4*mass_bits)+mass_bits-l); 
m6 <= mass_b(5*massjrits to (5*mass_bits)+massjbits-l); 
m7 <= massjb(6*massjbits to (6*massJbits)+massJ>its-l); 
m8 <= accumsave(massjbits to (mass_bits)+mass_bits-l); 



cml <= compl_mass_b(0 to mass__bits-l); 
cm2 <= compl_mass_b(mass_bits to (mass _bits)+massj>its-l); 
cm3 <= compl_mass_b(2*mass_bits to (2*mass_bits)+mass_bits-l); 
cm4 <= compl_massJb(3*mass_bits to (3*massJ>its)+massJrits-l); 
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cm5 <- compl_mass_b(4*mass_bits to (4*mass bits)+mass bits- IV 
cm6 <= compl_massJ,(5*mas S Jrits to (5 Wsl>its)+mass~bits-l V 
cm7 <- compLmass_b(6*mass^bits to (6 Wsl>its)Wsl>its-lV 
cm8 <= com P Laccumsave(mass^bits to (mass bits)+mass bits-1)- 



mass_save(l to num_stages-l) <= save_b ; 
mass_save(numjjtages) <= slide_save(num_stages-l) ; 

complement_rnass_save(l to num„stages-l) <o= compl save b • 
complement^niass_save(nuin.stages) <« compLslideJaveOl^stages-l) ; 

masses(0 to ((numjitages-l)*(massj3its))-l) <= mas s b* 

masses(((num_stages-l)*(mass bits)) to ((num staees^lWmass hiM^+m« c u * in^ 

(2*massJ>its)-l); u " J - bia S es L > Unassj>its))+massjnts-l) <«accumsave((massj)its) to 

complement^masses(Oto((num_stages-l)*(mass bits))-l) <= compl mass b- 
complement masses(((num stages- mass bitsYft J rr pi - m ass - b » 

compl_accumsav^^^ t0 « num - s *<^ ^ 

strand_ambiguites : for stage in 0 to num_stages-2 generate 

one_wiIdcard : ambigna PORT MAP ( 
^ address(0) => word_stage(( 634< 6 3*( s tage-l) - 9*(((stage-l)*( s tage-l) + (stage- 1))*)) ) 

^ addressd) => word_stage(( 63-K63«(stage-l) - 9*((( s tage-l)*( steg e-l) + (stag e -l))/2)) 

^ address^) => word_stage(( 63+(63*(sta ge -l) . 9*(((stage-l)*(stage-l) + (s toge -I))/2)) 

)+4), addreSS<3) * 63^63*(stage-l) - 9*(((stage-l)*(stage-l) + ( s tage-l))/2)) 

clock => elk, 

elken => enable, 

q(0) => ambig(stage+l) ); 
end generate; ' 



strand_masses : for stage in 0 to num_stages-2 generate 
mlut: masslut PORT MAP ( 



>H), 
)+3), 
)+4), 
)+6), 
)+7), 



address(5) 
address(4) 

address(3) 

address(2) 

address(l) 

address(O) 

clock «= 
enable = 

q 



=■> word stage(( 63+(63*(stage-l) - 9*(((stage-l)*(stage-l) + (staee-nV2Y. Y> 
word_stage(( 63+(63*(stage-l) - V^geV)^^ 

word_stage(( 63+(63*(stage-l) - 9*(((stage-l)*(stage-l) + (stage- 1))/2)) 
word_stage(( 63+(63*(stage-l) - 9*(((stage-l)*(stage-l) + (stage-l))/2)) 
=o word^stage(( 63+(63 '(stage- 1) - 9*(((stage-l)* (s tage-l) + ( S tage-l))/2)) 
«> word__stage(( 63+(63 '(stage- 1) - 9*(((stage-l)*(stage-l) + (stage-l))/2)) 

>clk, 

> enable, 

> mh*out( ((stage*mas S- bits)+mass^bits-l) downto stage*mass_bits) 
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); 

end generate; 



strand_breaks : for stage in 0 to num_stages-2 generate 
civ : cleavecheck PORT MAP ( 



)+l). 
)+3), 

)+4), 
)+7), 

); 

end generate; 



address(5) => word_stage(( 63+(63*(stage-l) 

address(4) => wordjstage(( 63+(63*(stage-l) 

address(3) => word_stage(( 63+(63*(stage-l) 

address(2) => word_stage(( 63+(63*(stage-l) 

address(l) => word_stage(( 63+(63*(stage-l) 

address(0) «> word_stage(( 63+(63*(stage-l) 

clock =o elk, 

enable **> enable, 

q(l) -> sdJnifflCstage+l), 

q(0) => fb_buff(stage+l) 



9*(((stage-l)*(stage-l) + (stage-l))/2)) )), 

- 9*(((stage-l)*(stage-l) + (stage- 1))/2)) 

- 9*(((stage-l)*(stage-l) + (stage-l))/2)) 

- 9*(((stage-l)*(stage-l) + (stage-l))/2)) 

- 9*(((stage-l)*(stage-l) + (stage- 1))/2)) 

- 9*(((stage-l)*(stage-l) + (stage-l))/2)) 



- Now the portmappings for the complementary devices 

compLstr_ambiguites : for stage in 0 to num_stages-2 generate 
one_compl_wild : compl_ambigna PORT MAP ( 

address(0) ~ =>word_stage(( 63+(63*(stage-l) - 9*(((stage-l)*(stage-l) + (stage-l))/2)) 



)+6), 
)+7), 
)+3), 
>+4), 



address(l) 
address(2) 
address(3) 



=>word_stage(( 63+(63*(stage-l) - 9*(((stage-l)*(stage-l) + (stage-l))/2)) 
^word_stage(( 63-K63*(stage-l) - 9*(((stage-l)*(stage-l) + (stage-l))/2)) 
=oword^stage(( 63+(63*(stage-l) - 9*(((stage-l)*(stage-l) + (stage-l))/2)) 



); 

end generate; 



clock «=e> elk, 

elken enable, 

q(0) => compl_ambig(stage+l) 



compl_str_masses : for stage in 0 to num_stages-2 generate 

compI_mhit : compl_masslut PORT MAP ( 
) + 6), address ( 5 > =>word^tage(( 63 + (63*(stage-l) - 9*(((stage-l)*(stage-l) + (stage-l))/2)) 

^ address(4) ^word^ S tage(( 63+(63 '(stage- 1) - 9*(((stage-l)*( st age-l) + (stage- 1))/2)) 

)+3), address P) =>word_stage(( 63+(63*(stage-l) - 9*(((stage-l)*(stage-l) 4- (stage-l))/2)) 

^ address(2) ^ word_stage(( 63+(63*(stage-l) - 9*(((stage-l)*(stage-l) + (stage- 1))/2)) 

address(l) => WO rd_stage(( 63+(63*( S tage-l) - 9*(((stage-l)*(stage-l) + (stage-l))/2)) )), 
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^ address^) => word _ stage(( 63+( 63*(stage-l) - 9*(((stage-instage-l) + (stage- 1))/2)) 

clock => elk, 
enable «=*> enable, 
); q ^^-^-^((^^ 

end generate; 



compLstr_breaks : for stage in 0 to num_stages-2 generate 
compI_clv : compl_cleavecheck PORT MAP ( 
address(5) => word_stage(6), 

-> word^stage(( 63+(63*(stage-l) - 9*(((stage-l)*(stage-l) + 



)+7), 
)+3) 4 
)+4), 

HI), 



address(4) 

address(3) 

address(2) 

address(l) 
address(0) 



); 

end generate; 



clock => dk, 

enable -> enable, 

q(l) => compLsd_buff(stage+l), 

q(0) complJbJmff(stage+l) 



word^stage(( 63+(63*(stage-l) - 9*(((stage-l)*(stage-l) + 

-> word^stage(( 63+(63*(stage-l) - 9*(((stage-l)*(stage-l) + 

word_stage(( 63+(63*(stage-l) - 9*(((stage-l)*(stage-l) + (si 
-> word_stage(( 63+(63*(stage-l) - 9*(((sV-l)*(stege-l) + 



(stage-l))/2)) 

(stage-l))/2)) 

(stage-l))/2)) 

- »D)/2)))), 
(stage-l))/2)) 



process(currState,enable) 
begin 

if enable = T then 

case currState is 



when reset =o 

nextState <= summing; 

when summing => 

nextState o summing; 

when others *=> 

nextState o= reset; 

end case; 



end if; 
end process; 



process(clk,enable,calc_reset,word_stage) 



stage)/2))); 



for stage in -1 to num_stages-3 loop 
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end loop; 

ifcalc_reset = Tthen 

currState <» reset; 
elsif rising_edge(clk) then 

if enable = , l f then 



- Events that occur on every enabled edge 

currState <= nextState- 

- AH for the original (not complementary) strand 

bs_buff(l) <= bs_buff(0); 

bsjjuff(2)<=bsj>ufj{l); 

foIlowing^break<=fb buff' 

sd_bufr2 <= sdjwff; 

start_detected <= sdJ?ufE2; 

discard <= discard J>uff; 

followingLbreakCS) <= followinOreak(7); 

- Same as above but for complementary strand 

compLbs_buff(l)<=compl bs bufftOV 
compJ> s _buff(2) compf bs~buffn v 
compLfollowin^break<-compl ft buffi 
compl_discard <= compl_discard buff- ' 
^PLfonowmgLbreakCS) comp L foll owin£Lbreak(7) . 

if init_ctr 9 then 
rdy <= '1'; 

else 

rdy <= f 0'; 

end if; 



* currState is 
when reset => 

init_ctr <= (others => '0 1 ); 

- All the initializations for the original strand 

wordaccum <= (others => '0 f ); 
accumsave <« (others => 'O; 
word_stage <= (others => *oj ; 
mass_a <= (others => 'O 1 ); 
massjj <= (others => »0 ! ); 
slide_save (others =o '0»); 



slidingwindow <= (others => 'O'V 
startdetected (others => »0'); ' 



— — • — • 

save_b <= (others => 'O'); 
bs_buff<^"100"; 
breakjn_stage <^ (others => 'O'V 
followingLbreak <= (others '0'); 

• All the initializations for the complementary strand 

compl-wordaccum (others '0*); 
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compl_accumsave o= (others «=> H)*); 
compl_mass_a <= (others => '0 1 ); 
compI_mass_b <= (others => '0'); 
compl_slide_save <= (others => '0'); 
compl_slidingwindow <*= (others => f 0'); 
compl_save_J> <= (others => '0'); 
complJ>s_bufF<:= "100"; 
compM>reak_in_s tage <= (others => '0'); 
compljc-llowing^break <= (others => '0*); 

when summing => 

if (init_ctr <= 8) then init_ctr <= ink_ctr + 1 ; end if; 



- The first a-register always gets the mass of the first amino acid in evety word 

mass_a(0 to mass__bits-l) <== mlut_out(mass bits-1 downto 0): 
slide_save(0) <= sdJniffU); *" 
slidingwindow(0 to~mass bits-l)<= (others => »0'V 
bs_buff(0)<-«0'; " h 

-Similar setup for complementary strands 

compLmass_a(0 to mass_bits-l) <= com P I_mlut_out(mass bits-1 downto 0): 
compLshde^save(0)o=compLsd^buff(l); ~ wra "A 

compl_sliduigwindow(0 to mass bits-l)<= (others *=> '0*V 
complJjsJmfflCO)^'©'; 

~ ™'Jf w WOfd h StartS with me m 63 bit word and at every stage it 

~ Presses 9 b,te (one codon - one amino acid) until all 63 bits = 7 amino acids have been 

- processed (both the original and complementary strands use this pipe) 

word_stage(0 to 62) <= ramword(63 downto 1); 
for stage in 0 to numjstages - 3 loop 

end loop; 



" *?T fOT strand - They check everyTta^efor a wild card in the 

~ "'dons (guranteed wildcard) or the specific codons that will create ambteuityTf 

-there is a wildcard in the third position amDiguny it 

for stage in -1 to num_stages-3 loop 
for stage in -1 to num_stages-3 loop 
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OR word ^mH63^T l ?S^^^ Z^^T^ " 9 *«stage*stage + stage)/2))) 
compLambig(stage+2)); . 5 ge + sta Se)/2))) OR ( compI_third^pos^check(stage+2) and 

end loop; 



- Keeps track of which words should not be saved (flushes th7h7^~ ~Z 

discard_buf!(l) wfflSSfSr » wildcard) 

for i in 2 to nutn_stages-l loop ' 

end loop; * (discard^buff(i-l) OR wildcard(M) ); 

if slid ej5 ave(7) = T and (discard_buff(8) « '!•) then 

discard_buff(8) <= '0'- 
else ' 

discarolbuff(8) <= scardj>uff(7); 

end if; 

- Same for the complementary strand 

fori m 2 to 7 loop 

end loop; co ^discard_bu^0<- (co ^ l dkc ^_ bl ^ W) ORcon.pLwildcardCi.l) ); 

if co ra pl_ s lide_save(7) = T and (co mpL discard_bu^8) = T) then 
els com PL^iscard_bufiX8) <= •()•; 

compLdiscard^burT(8) <= compLdiscard^bufF(7); 



end if; 



- starting point was to this word.) ^ " ca * he sa ™d right away (its 

break_in_stage(l) bs_bufS[2) or S d_bufl(l) ; 
fori in 2 to 7 loop 

sd_bufi(i) ; break.in.stageC.) <= (breakta_stage(i-l) OR foUowtog_break(i)) or 

end loop; 

breakjn_stage(8) <- break_in_stage(7); 
- The same for the complementary strand 

complj,^^^^ 

for i in 2 to 7 loop 

compl_followtag_break(i)^ <- (compLbreak_in_ S tage(i-l) OR 

end loop; 

compl_break_ln_ s tage(8) <= compl_break_to_stage(7); 
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— Stuff to deal with the sliding window 

— This is for the original strand 

for i in 1 to 6 loop 

if following^breakO^O' and sdjbuff(i+l) =■<)• then 
t<j . . j v _ stidmgwmdow((i)*massJ>its to (mass_bits)*(i)+(mass bits-1)) <= 

shdingwindow((i-l)*(massjHts) to (massj>its)*(i-l)^massjrits-l)); 

if slide_save(i-l) = T then 

slide_save(i) <=T; 

else 

slide_save(i) <- '0'; 

end if; 

else 

if break_in_stage(i) = '0' then 
slide_save(i) <= T; 

, . _ . slidingwindow( ((mass_bits)*(i)) to ((mass - bits)*(i))+(mass bits-1)) 

<= mass_a( ((massjnts)*(i-l)) to ((mass^b^^O-l^-Kmass^bits-l)); 

else 

. , „. slidingwindow((i)*(mass_bits) to (mass_bits)*(i)+(mass bits-1)) <= 

shdmgwindow((i-l)*(massJ>its) to (mass_bits)*(i-l>f(mass_bits-l)); 

if slide_save(i-l ) = '1 ' then 

slide_save(i)<»T; 

else 

slide_save(i) '0'; 

end if; 

end if; 

end if; 

end loop; 

slide_save(7) <= (slide_save(6) or ( (not save_b(7)) and following; break(8) and 
(break_in„stage(7))))and(notdiscard(8)); *~ W 

— COMPLEMENTARY STRAND 

— Same thing : sliding window for the complementary strand 

for i in 1 to 6 loop 

if compl_followin£J)reak(i>='0 , and compl_sd_buff(i+l) ='0' then 
, t . J1 compLslidingwindow((i)*(mass_bits) to (massjjits)*(i)+(mass bits-1)) <= 

compl_shdingwmdow((i-l)*(mass_bits) to (massj>its)*(i-l)+(massj>its-l)); 

if compl_slide_save(i-l) = T then 

compl_j3lide_save(i) <« •l 1 ; 

else 

compl_slide_save(i) , 0'; 

end if; 
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else 

if compIj5reakJn_stage(i) =» , 0* then 
compl_slide_save(i) <= T; 

if compl_slide_save(i-l) = .'1' then 

compl_slide_save(i) <= T; 

else 

compl_slide_save(i) <= r 0'; 

end if; 

end if; 

end if; 

end loop; 



- ORIGINAL STRAND ~" 

- The following loop detennines when to add or flush the buffers 

- Stuff to deal with the actual summation and sending to scorer 

for i in 1 to 6 loop 

if followingj>reak(i)='0' and sd_buff(i+l) ^O' then 

(((mass.b W (rnass bits-1)) dow^^ ) ~ -^out( 

l))+(mass_bits-l)); " wnassjms) i)) + mass^a((i-l)*(mass_bits) to ((massj>its)*(i- 

savej>(i) «o= '0 1 ; 

else 

(«^s_bits).i)^^^ to (((^-bits^iWmass.bi,,!)) ) mIut _ 0Ut( 

if break Jn_stage(i) = »0' then 
savej>(i) <» , 0'; 

else 

if discard© = '()' then 

save b(i) <« T: 
endif; ~ 

end if; 

end if; 

end loop; 
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if followingJ)reak(7) - W then 
saveJb^^'O 1 ; 

else 

if breakjn_stage(7) =» '0' then 
save_b(7) <= '0'; 

else 

if discard(7) - »0' then 

save_b(7) <=T; 

end if; 

end if; 

end if; 



- COMPLEMENTARY STRAND 

- The logic appears identical, but the mints (the mass lookup tables) have been mapped differently to 

- account for the transposed and complemented nucleic acids within a word 

for i in 1 to 6 loop 

if compLfollowing^breakOHO' and compl_sd_buff(i+l) ='0' then 
, , , „. compl_mass_a(((mass_bits)*i) to (((mass bits)*iVKmass bits-lY> ^ <= 

^r^^ss:^-^ downto ^i^SffSU^ 

compl_save_b(i) <= '0 1 ; 

else 

™™i «w „ u . ixA ^ compl_mass_a(((mass - .bits)*i) to (((mass bits)*i)+(mass bits-1)) ) <« 

compLmlut_out((((rnass_.bits)*0+(niass_bits-l)) downto ((massj>its)*i)); " ' 



ifcompl_break_in_stage(i) = , 0 , then 
compl_saveJ)(i) '0*; 

else 

if compl_discard(i) = '0' then 

compl_save J)(i) <= T; 

end if; 

end if; 

end if; 

end loop; 



if compl_folIowing L ,break(7) = '0' then 
compl_save_b(7) <= '0'; 

else 



if compl_break_in_stage(7) = '0' then 
compl_save_b(7) <= '0'; 



if compi_discard(7) ■= '0* then 

compl save b(7)<='r: 
end if; ~ " 

end if; 
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end if; 



- ORIGINAL STRAND 

- The b registers are sent to scorer and the final accumulator 

- The previous amino acid mass 

mass_b <= mass_a; 

- COMPLEMENTARY STRAND 

compl_mass_b <= compl__mass a; 



— ORIGINAL STRAND 

- word accumulation if a single mass spans more than one word 

if slide_save(6) - T then 

if(discard(8)» , 0')then 

if save_b(7) - f 0 f then 



else 

slidingwbdowtfn^^^ wordaccum + 

end if; 

end if; 

wordaccum <- (others => •()'); 

else 

if(discard(8) = '0 , )then 

(num^stages^nmass^bits^ass^bits-l); <= WOrdaccum + mass.banum^tage^^mass.bits) to 

if following_break(7) * , 0' then 
ifsaveJj^'O'then 

mass^b((num_stages-2)*(mass^bits) to (num.stages-2^^ WOrdaCCUin 

else 

wordaccum <= (others => '0')' 

end if; 



ifsave_b(7) = '0»then 

mass.b((num^stages-2)*^bits) to (num^stages-2)*(mass.bUs^ <= ™*"<™ 

else 

wordaccum <= (others ==> 'O'y 

end if; 

end if; 

end if; 

end if; 
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- COMPLEMENTARY STRAND 

— Same accumulation for the complementary strand 

if compi_slide_save(6) = T then 



if (compl_discard(8) = 'O*) then 

if compl_save_b(7) « ! 0' then 



compl mass b((num stages-2Wmass bit^ t n , com P/- wcu ^ ave ^ compl_wordaccum + 

else 

compLslidingw^^^^ + 



end if; 

compl_wordaccum <= (others => '()')• 
else " 

if (compl_discard(8) = 'O' ) then 

con.pLmassJ^u^^ 

if compl_followmg_break(7) = '0' then 
if compl_save_b(7) = 'O 1 then 

compLmass.bKn^stages-^ass.bits) to (num^es-^^S^bitsT); C ° m '"- wordaccum + 

else 

complwordaccum <= (others => 'O'V 
end if; 71 

else 

if compl_save_b(7) = '0' then 

co m p Lm ass_b((nu m _s te ge S -2)» (ra a S s_bi b ) to i^j^TrS^^^y «^~*««» + 

else 

compI_wordaccum <=• (others => '0'V 
end if; " 

end if; 

end if; 

end if; 



when others => 

end case; 

end if; - for Altera's enable 
end if; 

end process; 

end calc_flow; 
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5. Scoring Unit Controller (scorer. vhd) 

library ieee; 4 ' 

use ieee.stdjogicj 164.all; 
use ieee.std_logic_arith.all; 
use teee.std_logic_signed.all; 



entity scorer is 



port( 



generic( num_stages : integer :~ 10; 

mass_bits : integer := 25; 
tolerance_bits : integer :=» 3; 
num_freq_bits : integer := 8; 
numjbins: integer := 128; 
selected_mass_bits : integer := 9; 
encoder_mass_bits : integer := 7 ); 



); 

end scorer; 



tm3_clkv0 
reset 

MS_input 

score Jm3want 

score_sunready 

score_tm3ready 

score_sunwant 

hitlocation 

scan_complete 

good_match 

compl_good_match 

mem_scanned 

match_address 

memjorjrame 

rreq_product 

num_matches_out 

hist_maxjreq 

compl_freq_product 

compl_num_matches_out 

compl_hist_max_freq 

calc_enable 

freq_enable_signal 

score sent 



: in stdjogic; 
: in std_logic; 

: in std_logic_vector((mass_bits-l) downto OV 
: out stdjogic; 
: in stdjogic; 
: out stdjogic; 
: in stdjogic; 

: out std_logic_vector(18 downto 0); 
: out stdjogic; 

: out std_logic_vector(0 to num_stages-l); 
: out stdjogic_vector(0 to num_stages-l); 
: in std_logic; 

: in std_logic_vector(l 8 downto 0); 

: in std_logic_vector{63 downto 0); 

: out stdjogic_yector(0 to num freq_bits-l); 

: out std_Iogic_vector(7 downtoO); 

: out sMJogic_vector(num_frea_.bits-l downto 0); 

: out std_logic_vector(0 to num_treq_bits-l); 

: out stdJogic_vector(7 downto 0); 

: outstd_logic_vector(num freq_bits-l downto OV 

: rn std_logic; " 

: in std_logic; 

: out stdjogic 



architecture score_struct of scorer is 



- Statistics for low/high frequency mass ranges 

component modjrequency table 

port( 



elk 
rst 
enb 



end component; 



: in stdjogic; 
: in stdjogic; 
: in stdjogic; 
evaluate_mass : in stdjogic; 
maxjreq : in std_logic_vector(0 to 5); 
save_freq : in stdjogic; 

Iow_frecLpeptides : out stdjogic_vector(0 to num stages-lV 
mass valid : in stdjogic_vector(0 to num^tages-f); * h 
matching^stages : in stdjogic_vector(0 to num stages-lV 
hist max_rreq : out std_logicvector(0 to num freqjrits-i); 
Pi J- : out stdjogic_vector(0 to num_freq_bits~l); 

r * nn ™ 1 in stdjogic_vector(0 to (num_stages*7)-l)); 
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- 128 entry RAM Block to store the MS detected values 
component specjvals 
port( 

address: IN stdJogic_VECTOR(8 downto 0V 
clock: IN stdjogic; 

data: IN sMJogic_VECTOR(24 downto 0): 
q: OUT stdJogic_VECTOR(24 downto 0); 
wren: IN stdjogic); 
END component; 



- Fragment Mass Calculator 
component mod__calc 

port( 



downto 0); 

(num_stages)*(massj>its)-l); 
num_stages); 

(num_stages)*(mass_bits)-l ); 



end component; 



elk 

, : in std logic; 

caicreset : in stdjogic; 

enable A : in stdjogic; 

ramword : in stdJogicvector<63 

masses : out stdjogic_vector(0 to 

mass - save : out stdJogic_vector(l to 

complement_masses : out stdjogicvectorfl) to 

compIement_mass_.save: out stdjogic_vector(l to num_stages); 

rdy : out stdjogic); 



iSSSKS? ,0 check how closely * e detected — mateh *• 

port( 

dataa: IN stdJogic_VECT OR(2 downto 0); 
datab: IN stdJogic_VECTOR(2 downto 0); 
clock: IN stdjogic; 
AleB: OUT stdjogic); 
end component; 

- ROMs to help count the total number of matches 

component count_rom 
PORT 
( 

address 
clock 
enable 

); q 

end component; 



: IN STD JLOGIC_VECTOR (7 DOWNTO 0V 
:INSTDJX>GIC; h 
:INSTDJLOGIC :=T; 
: OUT STD_LOGIC_\TECTOR (3 DOWNTO 0) 



(rst > soft_rst,readl_MSl_data,read2 MSI data.imtiali^e^Sl m«« i™a>> is 
score2,compare, done); " ««».miiiaiize,meinJoadl ,memJoad2,mem_save,return__scorel ,return_ 

signal currState : matchStates; 
signal currState J>uffer : matchStates; 
signal nextState : matchStates; 
signal nextStateJwffer : matchStates; 
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: stdjogic; 
: stdjogic; 



stdjogic_vector(0 to 63); 



signal memvar 
signal load_compare 
signal calc_difference 
signal hi 
signal mass_line 
(num_stages)*(mass_bits)-l); 

SS ~rS e = ^--VectoKO U, (nun, stages)^,^,.,); 

signa! compLmasl.saveJine: std_logic_vector(l to ^.i^*"-^ 1 *° num - sta g** 



: stdjogic; 

stdjogic_vector(0 



to 



: stdjogic_vector(0 

: stdjogic_vector(0 to (num_stages)*(mass_bits)-l); 
AJ . . : stdJogic_vector(l to num stages); 

: stdJogic_vector(l to num_stages); ~ 

: stdjogic_vector(tolerance_bits-l downto 0); 



stdjogic_vector(0 



signal freq_massjine 
(num_stages)*(mass_bits)-l); 

signal compl_freq_massJine 
signal freq_mass_save_line 
signal corapI_freq_mass_save_line 

signal userjolerance 

signal pipejmass_line 
(num_stages)*(rnass_bits)-l); 

sipt^^^^^ 

(num_stages)*(mass_bits)-l); ' std_togic_vector(0 

signal pip^compLma^Jine • std_logic_vector(0 to (num.stage^massJntsH); 

signal mass_indexbufferl : std_logic_vector< 0 to (num stagesWencoder mas, hif^ iv 
s,g^mass_mdex_buffer2 :stdjogic_vecto«0to man^^^'mls "oX! 
s,gnal mass_mdex : std_logic_vector( 0 to (num_ s tages)*(enctder_massSS 
signal diff 

((mass_bits)*(num_stages)>l downto 0); 

signal compl__diff 
downto 0); 



to 



to 



signal absdiff 
((raassJ>its)*(mimj>tages)H downto 0); 

signal compLabsdiff 
((massj)its)*(num_stages))-l downto 0); 

- signal good_match : stdjogic_vector(0 to num_stages-l); 



* std_logic_vector( 
: stdJogic_vector( ((massj>its)*(num_stages))-l 

i stdjogic _vector( 
: stdjogic_vector( 



signal spec_mass 
signal s tored_sp ec_mas s 
downto 0); 

^ signal compl_stored_spec_mass 

signal stored_spec mass__reg 
downto 0); 

^ signal compl_stored_spec_rnass_reg 

signal match_ctr 
signal mem_ctr 
signal index 

to((num_stages)*(selected_massJ>its))-l ); 
signal compljndex 

to((num_stages)*(selected_mass_bits))-l ); 
signal ftame__calc_ready ~ 
signal freq_calc_ready 



: stdjogic jvector( (mass_bits-l) downto 0)- 

: stdjogicvector( ((mass_bits)*(num_stages))-l 

dJogic_vector( ((massjrits)*(nuni_stages))-l downto 

: stdJogic_vector( ((massj)its)*(num_stages)>l 
dJogic_vector( ((mass_bits)*(numj?tages)M downto 



: stdJogic_vector(7 downto 0); 
: stdJogic_vector(7 downto 0); 



stdjogic_vector(0 
stdjogic_vector(0 



: stdjogic; 
: stdjogic; 
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signal match found . _,j • 

signal compfmatch.found : MJofr^^?^^**™* <» 

signal numjnatches . 

signal comFLnum.matches : sWJogic.vecto^intoof 6 ^ 7 *™*° ° ): 

signal curr_num match . 

signal cornpLcu^num^match s d^SHS * ^ ownto °>> 

. std_logic_vector( 3 downto 0); 

signal Iow_fi*q_p<£tide S : std_logic_vcctor(0 to num stages-lV 
signal freqtabl^massjine : stdjogic_vector(0 to (num S ta R es*71 IV 

signal freq_en_buff : stdjogic; 
signal save_freq : stdjogic; 
signal evaluate__mass : stdjogic* 

UK r~ Valid : st <Uogic_vector(0 to num stages-1); 

signal maxjfreq : stdjogic_vector(0 to 5)- 

signal pipejowjreq : stdjogic_vector(0 to (num stages*3)-lV 

signal compl^pejowjteq : stdjogic^vector(0 to (nu^n jLg^3)-l); 

signal reg_rreq_enable : stdjogic; 
signal reg_calc_enable : stdjogic; 

attribute synjjlackbox : boolean; 

attribute syn_blackJ>ox of sp ec J>urTer: component is true- 
attnbute syn_black_box of count^rom: component is true! 



hi<=»r ; 

userjolerance <*= "OOl"* 

SS^SSESg ^ fre ^--dy) OR (calc.enable and fr ame_calc ready); 
max_freq <= "011001"[ 

selector_units : for i in 0 to num_stages-l generate 

singlejstagejmffer : specials PORT MAP ( 
n address ^ index( selected.mass^its'i to (se^ted^.bits^^elected^asa^its- 

clock => tm3_clk_v0, 
data => specjnass, 
wren •=> Ioad_compare, 

q => stored.spec.massC (mass.bits^^mass.bits-l) downto mass.bite'i ) 

end generate selector_units; 

complement_selector_units : for i in 0 to num stages-1 generate 
compl_stage_buffer : specials PORT MAP ( 

(selected_mass_bits*i)^Lcted_ma^_bits-l ), COmp,Jndex < selected_mass_bits*i to 
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clock *=> tm3_clk_v0, 

data -> spec_mass, 

wren => load_compare, 

massj>its*i) q => ^PLstored^spec.massC (mas S- bits*i)+(mass_bits-l) downto 

); 

end generate compiement__selector_units; 



freqTable : mod_frequency_table port map( 

elk => tm3_clk_v0, 

rat => reset, 

enb => table_enable, 

evaluate_mass => evaluatejnass, 

maxjxeq => max_freq, 

save jfreq => save_freq, 

low_freq__peptides «> low_freq_peptides, 

mass_valid => freq_mass_valid, 

matching_stages => matchjFound, 

hist_max_freq => hist_max_freq, 

Pi_f^>freq_j)roduct, 

massjanges => freqtablejmassjine ); 

complJreqTable : mod_frequency_table port map( 

elk => tm3_clk_v0, 
rst => reset, 

enb => table_enable, 

evaluate_mass => evaluate_mass, 
maxjreq => max_freq, 
save_treq => save_freq, 

low^freqjeptides => compl_low_freq_peptides, 

mass_yalid => complJreq_mass_valid, 

matching^stages => compl_match_found, 

hist_max_freq => compljiist max_freq, 

Pi_f => compI_rreq_product," 

mass_ranges «> compl_freqtable_mass_line ); 



frame l_calculator : mod_calc port map( 
elk 

calcjeset 
enable 
ramword 
masses 
mass save 



=> tm3_clk_v0, 

==> reset, 

=^> calc_enable, 

^ memJfor_rrame, 
mass_line, 
> mass_save_line, 



complement_masses => compI_mass~line ~ 
complement_mass_save => compi__mass_saveJine, 
rd y trame_calc_ready ); 



freq_calculator : mod_ca!c port map( 
elk 

calc_reset 

enable 

ramword 



mass save 



=> tm3_clk_vO f 

reset, 

=> freqL_enabIe_signal, 

=> mem_for_frame, 
=> treq_mass Jine, 
«> freq_massjsave_line, 
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complementjnasses => compl_freq_massJine, 
complement_mass_save => compLfreq_mass_saveJine, 
y & > rreq_calc_ready ); 

check_difference: for i in 0 to numjstages-l generate 
mass_compare : thresh_comp PORT MAP ( 
dataa => absdiff( (mass_bits*i)+(tolerance_bits»l) downto mass 
datab => userjolerance, 
clock => tm3_clk_v0, 
AleB => lsb3elow_thresh(i) 

)» 

end generate check_difference; 



compl_check_difference: for i in 0 to num_stages-l generate 
compl_mass_compare : thresh_comp PORT MAP ( 

dataa => compLabsdifif( (mass_bits*i)-KtoIerance_bits-l) downto mass bits*i), 

datab ^userjolerance, - h 

clock => tm3_clkv0, 

AleB => compljsbj)elowjhresh(i) 

)» 

end generate compl_check_difference; 



m_counter : countjrom PORT MAP ( 
address => matchjbund, 
clock => tm3 _clk_v0, 
enable => hi, 
q => currjiurn_match 



cm_counter : count rom PORT MAP ( 

address => compl_match_found, 
clock => rm3_clkvO, 
enable => hi, 

q => compl curr_num_match 



begin 



restriction 



load_compare <= '0'; 
calc_difference <= T; 

score_tm3want <= '0'; 
score_tm3ready <= '0'; 
score_sent <»'r; 
scan_complete <= , 0 t ; 



- I'll clock it, the delay is too much (and make sure the freq_enburT gets a max Jan 
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-if fallmgj;dge(freq_enable_signal) then 
-if freq_en_burT - T and fireq_enable_signal * '0' then 
save_freq <= T; 

—else 

save_freq <« '0'; 

--end if; 

case currState is 

when rst => 

nextState <= readl_MSl_data; 
nextState J>uffer <= readl_MSl_data; 

when readl_MSl_data => 
score_sent <= '0'; 
score_tm3want <= T; 

if scorejsunready = T then 

nextState <~read2_MSl_data; ' 
nextStateJjuffer <= read2_MSl_data; 

else 

nextState <^ readl_MSl_data; 
nextStateJbuffer <*= readl_MSl_data; 

end if; 

when read2_MSl_data -> 
score_sent <- 'O 1 ; 
score_tm3want <= '0'; 

if score_sunready = '0* then 

nextState initialize; 
nextState_buffer <*= initialize; 

else 

nextState <= read2_MSl_data; 

nextState_buffer<«read2_MSl data; 
end if; ~ 



when initialize => 

load_compare <= '1'; 
score_sent <= •()'; 

if (match_ctr « "01 111111") then 
nextState <« softjrst; 
nextState J>uffer <= soft_rst; 



else 
end if; 



nextState <= readl_MSl_data; 
nextStateJbuffer <= readl_MSl_data; 



when compare => 



-if (mem_ctr <= 29) then 
if calc_enable = T then 

nextState <= compare; 

nextStateJbuffer compare; 

else 

nextState <?=» return_scorel; 
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nextStateJ)uffer<= return scorel; 
end if; ~ 



when return_scorel => 

score__sent <= '0'; 
score_tm3ready <= T; 

if score_sunwant = T then 

nextState <= return_score2; 
nextState_buffer <=* return_score2; 

else 

nextState o= returnjscorel; 
nextState_burTer<^return scorel: 

end if; 



when return_score2 ^=> 

score_sent <s= '0 f ; 
score_tm3ready <= '0'; 

if score_sunwant = *0' then 

nextState <= soft_rst; 
nextState_buffer <= soft_rst; 

else 

nextState <= return_score2; 
nextState_buffer<= return score2; 

end if; 



when softj-st => 

if calc_enable = T then 

nextState <= compare; 
nextState_buffer <= compare; 

else 

nextState <= softest; 
nextState _buffer <= soft_rst; 

end if; 



when done = 



scan_complete <= 1 1 '; 
nextState <= done; 
nextStatejbufTer <~ done; 



when others => 

nextState <*= rst; 
nextState_bu£fer rst; 



end case; 
end process; 



P rocess(tm3_clk_vO ( reset,fre< L .calc_re^ 



m_scanned) 
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begin 

ifreset=»Tthen 

currState <= rst; 
elsif mem^scanned *> T then 

currState <= done; 
elsif rismgj;dge(tra3_clkv0) then 

mass_index_buffer2 <= massjndex buffer 1; 
mass Jndex <= mass Juadex J)uffer2r 

- register these two so I can pipeline the sig and move it away from the BRAM 

stored_spec_mass_jreg <= storedjspec mass; 
compLstored_s P ec_mass_reg <= com?Lstored_sp ec _rnass; 

- wideor changed 

currState <= nextState; 
currStateJjufifer <^ nextState Jmffer; 

-these two enables have become clocked signals 

tabIe ~ enabI e <= freq_enable_signal OR calc enable- 
evaluate_mass <= calc_enable; 

if rreq_en_buff = T and freq_enable signal = '0' then 
saveJreq<=T; 

else 

save_freq <^ '0'; 

end if; 

freq__en_buff <= freq_enable_signal; 
for i in 0 to num_stages-l loop 

end loop- gO0d ~ match(i) PipeJowficqffl AND matchjfound(i); 

for i in 0 to num_stages-l loop 
compl_match_found(i); <=ompl_good_match(i) <= co mp l_pipe_low_fteq(i) AND 

end loop; 

for i in 0 to 1 loop 

P*eJ W _Wn^^ n^ges^num.stages-l)) <= 

end loop; ~ ' " 

lowjreqjeptides; PipeJ°wJreq(nun._stages*2 to num.stages^-Knum.stages-l)) <= 

for i in 0 to 1 loop 

end loop; ~ //f 

compljow.freqjeptides; ^-PV'Jw.fteqCnum.stages^ to num_stages*2 + („u m _st a ges-l)) <= 
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for i in 0 to num_stages-l loop 

if evaluate_mass - '(T then 

freq_mass_valid(i) <= freqjnass_savejine(i+l)- 
to ((i*massJ>its)+6)); freqtable^massjine(i*7 to (i*7)+6) <= freq_massjine(i*mass^bits 

else 

fi-eq_mass_valid(i) <== mass_savejine(i+l)- 
((iWsJrite)*,)); ~ freqtable^massjine(i*7 to (i*7>+6) massjine(i*mass_bits to 

end if freqtabIe - mass - line ( i * 7 to 0*7)+6) <= mass_index(i*7 to (i*7)+6); 

end loop; 



for i in 0 to num_stages-l loop 

if evaluate_mass = '0' then 

compJ_freq_mass_valid(i) compl_freq_mass_save line(i+l)- 
compl_freqtabIe__massJine(i*7 to (i*7~)+6) <= 



compl_rreq_mass_line(i*mass_bits to ((i*mass_bits)+6)); 

else 



compl_raass_line(i*mass_bits to ((i*mass bits)+6)V 
— FIX ~" 



compl_freq_mass_valid(i) <= compl_mass_save_iine(i+lV 
--compLfreqtable_massJine(i*7 to (i*7)+6) 



(i*7)+6); 



compLfreqtable_massJine(i*7 to (i*7)+6) <= massjndex(i*7 to 



end loop; 



end if; 



if freq_calc_ready = T then 
pipe_mass_line 
pipe_conipl_rnass_Jine 



end if; 



pipe2_mass_Jine 
pipe2_conipl_mass__line 



<= mass_line; 
<= compl_mass_line; 

<== pipe_raass_line; 
pipe_compI_mass_line; 



c ompl_curr_num_match ; 



num_matches_out <= num_matches; 
compl_num_matches_out <= compljaumjtnatches; 

if (&ame_calc_ready = T) and (calc_enable «T) then 

num_matches <= nuinmatches + "0000"+ curr_num_match- 
compLnum^matehes <- compl_num_matches * + 



"0000"+ 



end if; 



for i in 0 to num_stages-l loop 
absdifltfroass bits*i)44) OR ^"SsSSLm ^^^.^3) OR 
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) OR 



compl absdiff((mass bits*i>+9> OR i r»£l.f«, » • ^PLabsdifKdnass.bits*^^ stages) OR 




. „_„ msb_below threshCO <= mat 



aosamumass bits*i)+13) OR absdifnTm*™ i,;***i\Li,iA . «"jmiu^niass_Diis*i>H2) 

abs^masslbits*^ L ) OR absd^ft^ bTte*0+ 17 ^ or S K^rSrlf S8 ~ bitS *^ + 15 
absdifiRCdnass.bits'O+massJits-l) ); a ' ^WInass - b, te 17 ) OR absd^Cmass.bits'iH 18 ) 

compLabsdifK^bits*!)^) Or^KsSi^L^S^R < Com Pj-^5 mass - bi ^) + 3) OR 
compLabsdif^Cmass.bits'i^elected mass WW OR comtf \°£°°W l -*? d W™s S J>te*iy« i ) OR 
compl_absdiff((mass bits*i)+9) OR como iSwJ™ ^ ,ffl[(mass - b,ts *')+nun»_stages) OR 

c°nm L absdiff!^ 

c r pl_absdiff(<mass_bits*i) + 15 ) OR compl S£Zf\ £J£ l^C^Z^^-^^ OR 

match^foimd(i) <= msb__below__thresh(i) AND lsbJ>elowJhresh(i); 
compljsbj>elowjhresh(i); com P- ma Jound(i) <= compLmsb_below_thresh(i) AND 

end loop; 



case currStateJmffer is 

when rst => 

match_ctr <- (others => , 0 I ); 
mem_ctr <= (others => '0'); ' 
diff<= (others => 'O 1 ); 
compl_diff <= (others 'O*); 
absdiff <= (others => '0'); 
compl_absdifT<= (others => 'O'); 

num_raatches <= (others «> '0'); 
compl_num_matches <= (others '0'V 
matchjfound <= (others => '0»); 
compl_matchfound <= (others ==> *&)> 
specjnass <= (others => 'O'); 
index <= (others => *0»); 
compljndex <= (others => '0 1 ); 

„ , . when soft_rst => 

reset all the intermediate accumulators 

match_ctr <= (others => f 0 f ); 
mem_ctr <= (others => '0'); ' 
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diff <= (others => 'l 1 ); 
compl_dirT <= (others «o '(V); 
absdiff <= (others T); 
compLabsdiff <= (others =o '0'); 
num_matches <« (others «=> 'O');' 
compl__num_matches <^= (others 'O'V 
msb_belowjhresh <= (others => f 0'); 
matchjbund <» (others => , 0 t ); 
compl_match_found <= (others' => 'O*)' 
spec_mass <= (others => '0'); ' 
mem_ctr <= (others => , 0 r ); 
hitlocation <= match__address; 
index <= (others => 'O 1 ); 
compHndex <= (others => 'O 1 ); 



when initialize => 

matchjjtr <= match_ctr + 1 ; 
spec_mass <= MS_input; 



for i in 0 to num_stages-l loop 

((selected_mass^bits*i)+(selected mass bitsS^ ) <^ M <5 JE!^-^^ to 
selected^massjrits)); " " )} } MS_input((rnassJnts-l) downto (mass_bits- 

((sel ec^.m^bitsWelected^mass bitsA^f ^1 Ms ^J^^^^ to 
selected_massj>its)); " ' MS_input((mass_bits-l) downto (massjrits- 

end loop; 

when mern_save => 

memvar <« mem_for_rrame; 

when compare => 
mem_ctr <= mem_ctr + 1 ; 
for i in 0 to num_stages-l loop 

stored^spec_mass((mas S _bits*i)+(mass bits-tf*^ d °™ t0 maSS - bit8 * i > <!= 

(mass^bits*i>f(mass,bitill)); ~ J ™ t0 mass Jnts*i ) - pipe2_massjme(massj,its*i to 

. compLstored^spec.massamass bits*i)^ 
pipe2_comp L massJine(mass_bits*iTo (mass^its^mass^bits-l) ) ; mass_bits*i) 

<^(massj,its^^ dow » to massj>its*i ) <~ a bs( 

abs(compLdiff( (massJ>its*i W m^^^^ downto ™ss_bits*i ) <~ 

end loop; 

for i in 0 to num_stages-l loop 

if mass_savejine(i+l) « T then 

((selected_mass^bits*i)+(selected mass bits-lH ^ (selected_mass_bits*i) to 

((mass„bits*i)+selected_mass,bits)-l )f massjine( (massj)its*i) to 

else 
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((selected^^ (selected^mass^bits*i) to 

end if; 

ifcompl_mass_save_line(i-M) = , l , then 

((selected.mass.bits-O+Cselected mass bits-l)) ) COmpl > dex ( (seleo,ed_mass_bits*i) to 

((mass_bits*i)+ S elected_niass_biti)-l )f ' compl_mas S Jme( (mass_bits*i) to 

else 

((selected.mass.bits^^ (selected_mass_bits*i) to 

end if; 

end loop; 

when return_scorel => 
when others => 

end case; 

end if; 
end process; 

end score_struct; 



6. Histogram Architecture (mod_frequency_table.vhd) 

library ieee; 

use ieee.stdJogic_1164.aII; 
use ieee.stdJogic_arith.all; 
use ieee.stdJogic_unsigned.all; 

entity mod_frequency_table is 

generic( num_stages : integer := 10; 

num_rreq_bits : integer := 8; 

size : integer := 8*8 ; 

shift : integer := 8; 

num_bins: integer :- 128 ): 
port( * * 

clk : in stdjogic; 

rat : in stdjogic; 

enb : in stdjogic; 

evaluate_mass : in stdjogic; 

max_freq : i n stdjogic vector(0 to 5V 

savejreq : in stdJogicF 

low_rreq_peptides : out std_logic_vector(0 to num stages-lV 

mass_valid : in std_logic_vector(0 to num itages-1 V 

matchuig_stages : in stdjogic_vector(0 to num stages-1 V 

hist maxjreq : out std_logicvector(0 to num* rreq_bits-l); 

~ ; out stdjogic_vector(0 to num freq_bits-lV 

mass_ranges : in stdjogic_vector(0 to (num_stages*7)-l) 

); 

end mod_frequency_tabIe; 

architecture mod_stats of modjfrequencyjable is 

- decoder to decide which range is being incremented 
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component bin_decoder 
port( 



address: IN stdJogic_VECTOR(6 downto 0V 
clock: IN stdjogic; * 

^^^^^VECTQRCm downto 0); 
clken: IN stdjogic); h 
end component; 



- ROMs to help count the total number of matches 
component count_rom 
port( 

address: IN stdJogic_VECTOR(7 downto 0V 
clock: IN stdjogic; 
enable: IN stdjogic; 

q: OUT stdJogicJvT2CTOR(3 downto 0)); 
end component; 

-check to see if any of the frequency bmTmeet low thresh 
component or_34 
Port( 

elk : in stdjogic; 

arout : out stdjogic); ! " std >* c -Vector(127 downto 0); 

end component; 

- log conversion LUTs 

component logtable 
port( 

A: IN stdJogic_VECTOR(5 downto 0); 
CLK: IN stdjogic; 
QSPO_CE: IN stdjogic; 

end co mp S? : OUT ******** * o»; 



typefreqStetesis(reset > update_stats > locate max freq.rank massesV 
signal currState : freqStates; _««l,ranK_masses), 

signal nextState : freqStates; 

S 5 1U T - fteq : std >g ic - vec 'o'«» to num freq_bits-l); 

signal element_counter : std_logic_vector(6 downto 0V 

signsd h.st_max_freq_reg : stdjogic_vector(0 to num freq bits-lV 

signa^addr:stdjogic_vector(0to(num bins*8MV " 
signal binjner : stdjogic; ™ 

signal flagged granges : std Jogic_vector(0 to (num bins'num stages)- IV 

SS"} p,p f-r ,nass - valid ■• stdjogic_veetor(0 to num staged)- 

signal matchmg_mass : std_logic_vector(0 to num itagLlV 

signal frequeney_p,pelme : stdjogie_vector(0 to (mmf faqtbits^num.stages)-!); 

signal log_accum : std logic; 

i); signal logadder Jipe :ltdjogie_veetor(0 to (num.fteq.bits* (((num.stages-num.stagesHnum stages)*) )- 
signal log_val_stages : stdjogic.vector(0 to (num_stage S *num_freq_bits)-l ); 
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begin 



revjncrementjrange «=» increment ranee • 
folLmaxfreq "00" & maxfreqT ' 



^(((i-W) , ^fa? ! ^ ) : i ^ 8 ^- 1 > " + (i-l)^)) ) + 2) ,o (( sizef(size*(i-l) 

CLK => elk, 

QSPO_CE => evaluate_mass 

QSPO => loa.vaUtag^ Ita^bta to (i-^fl^fc) + (n^j^.,, , 
end generate log^convert; 



range.selectors : for i in 0 to num_stages-l generate 
to(7*i + 6)), 

to (128*i)+I27) 

incrementors : for i in 0 to numj>ins-l generate 



range^decoder : bin_decoder port map( 
address=> mass_ranges( 7*i 

clock elk, 

elken => mass_valid(i), 

q -> mcrement_range(128*i 

); 

end generate range_selectors; 



range>crement_value: count_rom port map ( 

address =>addr(i*8 to (i*8H7) 
clock => elk, 

enable =o binjner, 

q=> mcrement_amount(i*4 to (i*4)+3) 
end generate incrementors; 



good^ranges : for i in 0 to num.stages-l generate 



check_mass_range: or_34 port map ( 
clk=>clk, 

or.out -low_^epti d es((n^ fl e^J- rangeS(i * 128 to * 128 * 127 X 
end generate good_ranges; 
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process(currState,evaIuate rnass,save freq) 
begin ~ " 

binjncr <= '0'; 
case currState is 

when reset => 

nextState o=» update_stats; 

when update_stats => 

bin_incr «=* T; 

ifsaveJreq^Tthen 

nextState o= locate_max_rreq; 

else 

nextState <= update_stats; 

end if; 

when locate_max_freq => 

if element_counter= "1111111" then 
nextState <= rank_raasses; 

else 

nextState <= locate_max_rreq; 

end if; 

when rankjnasses => 

if evaluate_mass = '0' then 

nextState <~ update_stats; 

else 

nextState <= rankjnasses; 

end if; 
when others => 

end case; 

end process; 

process(enb,clk) 
begin 



ifrst = Tthen 

currState <= reset; 
elsif rising_edge(clk) then 

if(enb«T)then 

currState nextState; 
pipe_mass_valid <^ mass_valid; ' 
matching_mass <= matching^stages; 
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Iogadder_pipe <= (others =*• '0*); 



ogadder_p,pe(64 to 119) <= Iogadder _pipe(8 to 63V 
ogadderj pe(120 to 167) logaddeTpip^ to 119V 
ogadder_p,pe(168 to 207) <= logadderjije 128 to 67V 
ogadderj p«*208 to 239) logWjJe 176 to 207V 
ogadder_p,pe(240 to 263) <= Iogadder 216 to 239V 
ogadder_pip e (264 to 279) <= Iogadder jije 248 to 263 •' 
logadder_pipe(280 to 287) Iogadder jije(272 to 279); 

for i in 0 to num_bins-l loop 

& rev.increment.rangea^'nS)) rel\*rem2^ & ™J"crement_range(i + 128) 

end loop; 

for i in 1 to num_stages-2 loop 
(i*num_fteq_bits)+(num freq_bits-l)) ] 0 &-Val_accum(i*num_lreq_bits 

l)W4lbitsW„um-fetbll» + itgvalsJeVln^-i^r-^ 18 to «£ 

(num.freOite-l)); " log_val_stages( (i+l)*num_freq_bits to ((«+l)*num_freq_bits) + 

end loop; 

l)*num^eq_bits W num_^q_bits.l)) <= lo^TaS to < ^stages- 

l)*num^Oits)+(num^aIbits.l)) ; ^ ~ ^ ( num - sta ges-l)*num^freq_bits to ( (numlstages- 

(num_stages-l)*numjfre^^^ <== Io 8- vai - accum C (numjstages-^WiJfreaJrits to ( 

frequency^pipeline <= (others => '0'); 



case (currState) is 

when reset =*> 

frequency <= (others => 'O 1 ); 
bw_freq_range <= (others => '0'); 
frequency_pipeiine <= (others => '0'); 
log_val_accum <« (others => '0'); 
logadder_j)ipe <=» (others => '0'); ' 

when update__stats =o 

hist_max_freq_reg <= (others => '<)'); 
for i in 0 to numj>ins-l loop 

saved_freq <« frequency; 

if evaluate_mass = '0* then 

+ numjreqjrits-l ) <= frequency( i*num frea hit, ^A™* ^f^^Oits to (i*mimjreqj»ts) 

mcremenLamount(i*4to(i*4)+3); ^ num_rreq_bits to (i*numjfreqj 3 i t8 ) + num_freq_bits-l ) + 

log^val_accum <= (others => 'O 1 ); 
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else 



for i in 0 to num_stages-l loop 

saved_Jrequency_table <= frequency; 

end loop; 

frequency «c= (others =o> '0'); 



end if; 

end loop; 
when locate_jtnax_freq => 



hist_max_freq <= hist_max_freq_reg; 
element_counter<- element_counter+l; 

if (saved_freq(0 to numjreqjnts-l) >«'hist_max freqLreg) then 
hist_max_freq_reg <= saved_freq(0 to num_freq_bits-l); 



end if; 

for i in 0 to num_bins-2 loop 

Wnum^freOitsHnum freq_bits-l» ^ffeK' * v s 

((i+l)^num^freOits)^um%q_bits-l) ) ; saved^freq((i+l)*(num_freq_bits) to 

end loop; 

when rank masses => 



temp Jest <=» (others=> 'O'); 
if evaluate_mass = T then 

for i in 0 to num_stages-l loop 
if matching^mass( (num_stages-l) - i) = »1« then 
for j in 0 to nuinbins-1 loop 

« T then if incr ement_range( (i*numj>ins) + j ) 

to (i*num_freq_bits + (num_freq_bits-l)) ) <- "01001 101"; temp_test( i*num_freqjrits 

end if; 

end loop; 

end if; 
end loop; 

end if; 
when others => 



end case; 



end if; 
end if; 



end process; 
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Table 2 Precursor Ion Scan (PIS) Masses 

The following values (in Daltons) were used to obtain the results in Chapter 4. 
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532.1 
534.71 
538.06 


610.1 
610.17 
611.38 


684.01 
684.04 
686.56 


740.7 
741.71 
741.73 


816.34 
816.66 
817.48 


905.75 
905.75 


1021.7 
1022.19 


1090.67 
1091.02 


1214.66 
1218.78 


1386.25 
1392.41 


538.11 
! 547 
547.04 
550.04 


620.66 
621.71 
622.04 
622.18 


687.27 
687.45 
687.62 
687.93 


744.57 
747.8 
757.93 
758 


817.52 
821.31 
821.4 
822.39 


907.44 
917.78 
919.41 
922.41 
922.47 


1022.95 
1023.29 
1024.31 
1028.24 
1032.25 


1093 
1097.96 
1098.55 
1101.42 
1102.21 


1218,79 
1220.15 
1220.98 
1224.15 
1224.6 


1409.32 
1419.51 
1424.64 
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